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1  Heading 

•  PI:  Peng  Yin 

•  Organization:  Harvard  University 

•  ONR  Award  Number:  N00014-13-1-0593 

•  Award  Title:  Kinetic  Self-assembly  of  Complex  Structures  from  DNA  Tiles  and  Bricks 

2  Scientific  and  technical  objectives 

The  objective  of  our  work  is  to  engineer  synthetic  developmental  self-assembly,  where  a  synthetic 
molecular  structure  grows  isothermally  in  a  kinetically  controlled  fashion.  We  combine  the  struc¬ 
tural  insight  gleaned  from  building  complex  structures  from  single  stranded  DNA  bricks  with  the 
kinetic  insight  on  programming  assembly  pathway  using  hairpin  motifs  and  strand  displacement 
interactions.  This  paradigm  is  fundamentally  different  from  and  conceptually  more  powerful  than 
the  dominant  thermal  annealing  paradigm  for  assembling  synthetic  molecular  structures.  We  are 
using  this  developmental  self-assembly  framework  for  triggered  isothermal  self-assembly  of  highly 
complex  single-stranded  tile  and  bricks  based  nano-structures.  The  successful  implementation  of 
this  developmental  self-assembly  framework  will  introduce  a  profound  paradigm  shift  in  the  field 
of  synthetic  self-assembly,  and  promises  numerous  technological  applications  with  transformative 
impacts.  As  a  parallel  and  supplemental  effort  we  develop  software  tools  to  automate  the  design  and 
in  sitico  testing  of  these  structures. 

3  Approach 

We  report  herein  on  the  following  approaches: 

1 .  Development  of  fundamental  molecular  mechanisms  that  enable  the  developmental  self-assembly 
of  complex  DNA  nano  structures.  See  section  5.1  for  details. 

2.  Development  of  transformative  real  world  applications  based  on  dynamic  DNA  nanotechnol¬ 
ogy.  See  section  5.2  for  details. 

3.  Development  of  software  tools  to  enable  automated  design  and  in  silico  testing  of  kinetic  DNA 
nanosystems.  See  section  5.3. 

4  Concise  accomplishment 

•  A  Biochemical  Nanoscope  via  Auto-cycling  Proximity  Recording.  Here,  we  present  a  microscope- 
free  “biochemical  nanoscopy”  method  that  records  sophisticated  nanostructure  features  in 
situ  for  later  readout.  The  method  is  based  on  a  conceptually  novel  Auto-cycling  Proximity 
Recording  (APR)  mechanism,  which  continuously  and  repeatedly  produces  proximity  records 
of  any  nearby  pairs  of  DNA-barcoded  probes,  at  physiological  temperature,  without  destroy¬ 
ing  the  probes  themselves. 
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•  Barcode  Extension  for  Analysis  and  Reconstruction  of  Structures  (BEARS).  We  introduce 
a  new  method  of  structure  characterization  for  simultaneous  quantification  of  every  strand 
species  incorporated  into  a  structure.  Our  method  uses  barcode  extension  and  next-generation 
DNA  sequencing  to  quantitatively  measure  the  relative  incorporation  of  individual  strands  into 
a  DNA  nanostructure. 

m  Eliminating  Kinetic  Barriers  through  Sequence  Design  A  llows  Rapid  Assembly  of  DMA  Origami 
at  Room  Temperature.  .  We  use  computational  sequence  design  to  eliminate  prohibitive  ki¬ 
netic  barriers  and  thus  assemble  DNA  origami  at  room  temperature  in  as  little  as  five  minutes. 

•  Quantitative  superresolution  imaging  with  qPA INT.  Here,  we  use  the  programmable  and  spe¬ 
cific  binding  of  dye-labeled  DNA  probes  to  countinteger  numbers  of  targets.  This  method, 
called  quantitative  points  accumulation  in  nanoscale  topography  (qPAINT),  works  indepen¬ 
dently  of  dye  photophysics  for  robust  counting  with  high  precision  and  accuracy  over  a  wide 
dynamic  range. 

•  Optical  imaging  of  individual  biomolecules  in  densely  packed  dusters.  Here,  we  show  that 
discrete  molecular  imaging  is  possible  using  DNA-PA1NT  (points  accumulation  for  imaging 
in  nanoscale  topography)  -  a  super-resolution  fluorescence  microscopy  technique  that  exploits 
programmable  transient  oligonucleotide  hybridizationNon  synthetic  DNA  nanostructures, 

•  DyNAMiC  Workbench:  An  Integrated  Development  Environment  for  Dynamic  DNA  Nanotech¬ 
nology,  Here,  we  present  software  implementing  a  three  OtierO  design  process:  a  high-level 
visual  programming  language  is  used  to  describe  systems,  a  molecular  compiler  builds  a  DNA 
implementation  and  nucleotide  sequences  are  generated  and  optimized.  Additionally,  our  soft¬ 
ware  includes  tools  for  analysing  and  OdebuggingO  the  designs  in  silica,  and  for  import- 
ing/exporting  designs  to  other  commonly  used  software  systems. 

•  Paper-based  synthetic  gene  networks  We  present  an  in  vitro  paper-based  platform  that  provides 
an  alternate,  versatile  venue  for  synthetic  biologists  to  operate  and  a  much-needed  medium 
for  the  safe  deployment  of  engineered  gene  circuits  beyond  the  lab.  We  demonstrate  this 
technology  with  small-molecule  and  RNA  actuation  of  genetic  switches,  rapid  prototyping  of 
complex  gene  circuits,  and  programmable  in  vitro  diagnostics,  including  glucose  sensors  and 
strain-specific  Ebola  vims  sensors. 

•  Toehold  switches:  de-novo-de signed  regulators  of  gene  expression  we  report  a  class  of  de- 
novo-designed  prokaryotic  riboregtilators  called  toehold  switches  that  activate  gene  expression 
in  response  to  cognate  RNAs  with  arbitrary  sequences.  Toehold  switches  provide  a  high  level 
of  orthogonality  and  can  be  forward  engineered  to  provide  average  dynamic  range  above  400, 

•  Complex  reconfiguration  of  DNA  nanostructures  We  have  developed  a  general  structural- 
reconfiguration  method  that  utilizes  the  modularly  interconnected  architecture  of  single-stranded 
DNA  tile  and  brick  structures.  By  using  this  method,  we  reconfigured  a  two-dimensional  rect¬ 
angular  DNA  canvas  and  a  three-dimensional  DNA  cuboid  into  diverse  prescribed  shapes. 

•  Developmental  self-assembly  of  a  DNA  tetrahedron  By  integrating  temporal  with  spatial  con¬ 
trol,  here  we  demonstrate  the  “developmental”  self-assembly  of  a  DNA  tetrahedron,  where  a 
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prescriptive  molecular  program  orchestrates  the  kinetic  pathways  by  which  DNA  molecules 
isothermally  self-assemble  into  a  well-defined  three-dimensional  wireframe  geometry. 

•  Multiplexed  3D  cellular  super-resolution  imaging  with  DNA- PAINT  and  Exchange -PAINT 
Here  we  use  the  transient  binding  of  short  fluorescently  labeled  oligonucleotides  (DNA- PAINT, 
a  variation  of  point  accumulation  for  imaging  in  nanoscale  topography)  for  simple  and  easy- 
to-implement  multiplexed  super-resolution  imaging  that  achieves  sub-10-ntn  spatial  resolution 
in  vitro  on  synthetic  DNA  structures. 

•  Design  space  for  complex  DNA  structures  We  perform  a  comprehensive  study  of  the  design 
space  for  complex  DNA  structures,  using  more  than  30  distinct  motifs  derived  from  single- 
stranded  tiles.  These  motifs  self-assemble  to  form  structures  with  diverse  strand  weaving 
patterns  and  specific  geometric  properties,  such  as  curvature  and  twist, 

•  Isothermal  self-assembly  of  complex  DNA  structures  under  diverse  and  biocompatible  condi¬ 
tions  We  describe  a  DNA  self-assembly  system  that  can  be  tuned  to  form  a  complex  target 
structure  isothermally  at  any  prescribed  temperature  or  homogeneous  condition  within  a  wide 
range.  We  were  able  to  achieve  isothermal  assembly  between  15  and  69  aC  in  a  predictable 
fashion. 

5  Expanded  accomplishments 

5.1  Fundamental  Accomplishments  in  Kinetic  Self-Assembly 

5*1.1  Barcode  Extension  for  Analysis  and  Reconstruction  of  Structures  (BEARS) 

Collections  of  DNA  sequences  can  be  rationally  designed  to  self-assemble  into  predictable  three- 
dimensional  structures.  The  geometric  and  functional  diversity  of  DNA  nanostructures  created  to 
date  has  been  enhanced  by  improvements  in  DNA  synthesis  and  computational  design.  However, 
existing  methods  for  structure  characterization  typically  image  the  final  product  or  laboriously  deter¬ 
mine  of  the  presence  of  individual,  labeled  strands  using  gel  electrophoresis.  Here,  we  introduce  a 
new  method  of  structure  characterization  for  simultaneous  quantification  of  every  strand  species  in¬ 
corporated  into  a  structure.  Our  method  uses  barcode  extension  and  next -generation  DNA  sequenc¬ 
ing  to  quantitatively  measure  the  relative  incorporation  of  individual  strands  into  a  DNA  nanostruc¬ 
ture.  By  quantifying  the  relative  abundances  of  distinct  DNA  species  in  product  and  monomer  bands, 
we  can  study  the  influence  of  geometry  and  sequence  on  assembly.  We  have  tested  our  method  using 
2D  and  3D  DNA  brick  and  DNA  origami  structures.  Our  method  is  general  and  should  be  extensible 
to  a  wride  variety  of  synthetic  DNA  structures. 

5*1.2  Eliminating  Kinetic  Barriers  through  Sequence  Design  Allows  Rapid  Assembly  of  DNA 
Origami  at  Room  Temperature 

DNA  Origami  [3j  is  a  robust  and  programmable  technique  to  make  DNA  nanostructures.  A  key  use 
of  DNA  origami  structures  has  been  as  nanoscale  breadboards  that  organize  components  like  small 
proteins,  nanoparticles,  small  molecules  and  other  DNA  strands.  Typical  DNA  origami  assembly 
requires  that  the  scaffold  strand  be  annealed  with  staple  strands  in  the  appropriate  buffer.  The  high 
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temperatures  (>  80C)  used  in  this  process  are  not  always  compatible  with  temperature-sensitive 
components  or  contexts.  In  response,  there  have  been  attempts  to  isothermally  fold  DNA.  Studies 
have  show  that  DNA  origami  can  assemble  isothermally  if  held  at  temperatures  around  60C,  or  at 
room  temperature  by  using  denaturing  agents.  However,  rapid  assembly  of  DNA  origami  at  room 
temperature  without  the  use  of  denaturing  agents,  which  can  be  toxic  or  have  other  unintended 
consequences,  has  not  been  achieved.  We  assembled  a  rectangular  DNA  origami  from  a  1000  base 
long  DeBruijn  derived  scaffold  sequence  and  a  corresponding  staple  set.  The  origami  assembles 
isothermally  in  as  little  as  five  minutes  at  temperatures  ranging  from  25C  to  50C.  Refer  to  appendix 
for  more  details. 

5.1.3  Com  plex  reconfigu  ration  of  DNA  na  nostructu  res 

Nucleic  acids  have  been  used  to  create  diverse  synthetic  structural  and  dynamic  systems.  Toehold- 
mediated  strand  displacement  has  enabled  the  construction  of  sophisticated  circuits,  motors,  and 
molecular  computers.  Yet  it  remains  challenging  to  demonstrate  complex  structural  reconfigura¬ 
tion  in  which  a  structure  changes  from  a  starting  shape  to  another  arbitrarily  prescribed  shape.  To 
address  this  challenge,  we  have  developed  a  general  structural-reconfiguration  method  that  utilizes 
the  modutarly  interconnected  architecture  of  singlestranded  DNA  tile  and  brick  structures.  The 
removal  of  one  component  strand  reveals  a  newly  exposed  toehold  on  a  neighboring  strand,  thus  en¬ 
abling  us  to  remove  regions  of  connected  component  strands  without  the  need  to  modify  the  strands 
with  predesigned  external  toeholds.  By  using  this  method,  we  reconfigured  a  two-dimensional  rect¬ 
angular  DNA  canvas  into  diverse  prescribed  shapes.  We  also  used  this  method  to  reconfigure  a 
three-dimensional  DNA  cuboid. 

5.1.4  Developmental  self-assembly  of  a  DNA  tetrahedron 

Kinetically  controlled  isothermal  growth  is  fundamental  to  biological  development,  yet  it  remains 
challenging  to  rationally  design  molecular  systems  that  self-assemble  isothermally  into  complex 
geometries  via  prescribed  assembly  and  disassembly  pathways.  By  exploiting  the  programmable 
chemistry  of  base  pairing,  sophisticated  spatial  and  temporal  control  have  been  demonstrated  in 
DNA  self-assembly,  but  largely  as  separate  pursuits.  By  integrating  temporal  with  spatial  control, 
here  we  demonstrate  the  “developmental”  self-assembly  of  a  DNA  tetrahedron,  where  a  prescriptive 
molecular  program  orchestrates  the  kinetic  pathways  by  which  DNA  molecules  isothermally  self- 
assemble  into  a  well-defined  three-dimensional  wireframe  geometry.  In  this  reaction,  nine  DNA 
reactants  initially  coexist  metastably,  but  upon  catalysis  by  a  DNA  initiator  molecule,  navigate  24 
individually  characterizable  intermediate  states  via  prescribed  assembly  pathways,  organized  both 
in  series  and  in  parallel,  to  arrive  at  the  tetrahedral  final  product.  In  contrast  to  previous  work  on 
dynamic  DNA  nanotechnology,  this  developmental  program  coordinates  growth  of  ringed  substruc¬ 
tures  into  a  three-dimensional  wireframe  superstructure,  taking  a  step  toward  the  goal  of  kinetically 
controlled  isothermal  growth  of  complex  three-dimensional  geometries. 

5.1.5  Design  space  for  complex  DNA  structures 

Nucleic  acids  have  emerged  as  effective  materials  for  assembling  complex  nanoscale  structures.  To 
tailor  the  structures  to  function  optimally  for  particular  applications,  a  broad  structural  design  space 
is  desired.  Despite  the  many  discrete  and  extended  structures  demonstrated  in  the  past  few  decades, 
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the  design  space  remains  to  be  fully  explored.  In  particular,  the  complex  finite-sized  structures  pro¬ 
duced  to  date  have  been  typically  based  on  a  small  number  of  structural  motifs.  Here,  we  perform 
a  comprehensive  study  of  the  design  space  for  complex  DNA  structures,  using  more  than  30  dis¬ 
tinct  motifs  derived  from  single-stranded  tiles.  These  motifs  self-assemble  to  form  structures  with 
diverse  strand  weaving  patterns  and  specific  geometric  properties,  such  as  curvature  and  twist.  We 
performed  a  systematic  study  to  control  and  characterize  the  curvature  of  the  structures,  and  con¬ 
structed  a  flat  structure  with  a  corrugated  strand  pattern.  The  work  here  reveals  the  broadness  of  the 
design  space  for  complex  DNA  nanostructures. 

5.1.6  Isothermal  self-assembly  of  complex  DNA  structures  under  diverse  and  biocompatible 
conditions 

Nucleic  acid  nanotechnology  has  enabled  researchers  to  construct  a  wide  range  of  multidimensional 
structures  in  vitro.  Until  recently,  most  DNA-based  structures  were  assembled  by  thermal  annealing 
using  high  magnesium  concentrations  and  non-physiological  environments.  Here,  we  describe  a 
DNA  self-assembly  system  that  can  be  tuned  to  form  a  complex  target  structure  isothermally  at  any 
prescribed  temperature  or  homogeneous  condition  within  a  wide  range.  We  were  able  to  achieve 
isothermal  assembly  between  15  and  69  °  C  in  a  predictable  fashion  by  altering  the  strength  of  strand- 
strand  interactions  in  several  different  ways,  for  example,  domain  length.  GC  content,  and  linker 
regions  between  domains.  We  also  observed  the  assembly  of  certain  structures  under  biocompatible 
conditions,  that  is,  at  physiological  pH,  temperature,  and  salinity  in  the  presence  of  the  molecular 
crowding  agent  polyethylene  glycol  (PEG)  mimicking  the  cellular  environment.  This  represents  an 
important  step  toward  the  self-assembly  of  geometrically  precise  DNA  or  RNA  structures  in  vivo. 

5.1.7  A  Biochemical  Nanoscope  via  Auto-cycling  Proximity  Recording 

Analysis  of  the  spatial  arrangement  of  molecular  features  enables  the  engineering  of  synthetic  nanos¬ 
tructures  and  the  understanding  of  natural  ones.  Microscopy  methods  can  provide  direct  visualiza¬ 
tion  of  complex  spatial  features  but  typically  require  sophisticated  equipment  with  limited  through¬ 
put.  Here,  we  present  a  microscope-free  “biochemical  nanoscopy”  method  that  records  sophisticated 
nanostructure  features  in  situ  for  later  readout.  The  method  is  based  on  a  conceptually  novel  Auto¬ 
cycling  Proximity  Recording  (APR)  mechanism,  which  continuously  and  repeatedly  produces  prox¬ 
imity  records  of  any  nearby  pairs  of  DNA-barcoded  probes,  at  physiological  temperature,  without 
destroying  the  probes  themselves.  We  demonstrate  the  proximity  and  auto-cycling  characteristics, 
and  then  apply  the  system  as  a  biochemical  nanoscope  to  accurately  decode  various  spatial  arrange¬ 
ments  of  7  unique  probes  in  homogeneous  test  samples.  We  further  demonstrate  the  ability  of  APR 
to  re-sample  the  same  dynamic  system  repeatedly.  Future  development  is  expected  to  enable  read¬ 
out  with  massively  parallel  sequencing  methods,  creating  a  truly  single-molecule  technique.  Refer 
to  appendix  for  more  details 

5.2  Applications  of  Kinetic  Self-Assembly 

5.2.1  Quantitative  superresolution  imaging  with  qPAINT 

Counting  molecules  in  complexes  is  challenging,  even  with  super-resolution  microscopy.  Here,  we 
use  the  programmable  and  specific  binding  of  dye-labeled  DNA  probes  to  count  integer  numbers  of 
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targets.  This  method,  called  quantitative  points  accumulation  in  nanoscale  topography  (qPAINT), 
works  independently  of  dye  photophysics  for  robust  counting  with  high  precision  and  accuracy  over 
a  wide  dynamic  range.  qPAINT  was  benchmarked  on  DNA  nanostructures  and  demonstrated  for 
cellular  applications  by  quantifying  proteins  in  situ  and  the  number  of  single-molecule  FISH  probes 
bound  to  an  mRNA  target. 

5.2.2  Optical  imaging  of  individual  biomolecules  in  densely  packed  clusters 

Recent  advances  in  fluorescence  super-resolution  microscopy  have  allowed  subcellular  features  and 
synthetic  nanostructures  down  to  10  -  20  nm  in  size  to  be  imaged.  However,  the  direct  optical  ob¬ 
servation  of  individual  molecular  targets  (ra  5  nm)  in  a  densely  packed  biomolecular  cluster  remains 
a  challenge.  Here,  we  show  that  such  discrete  molecular  imaging  is  possible  using  DNA-PAINT 
(points  accumulation  for  imaging  in  nanoscale  topography)  -  a  super-resolution  fluorescence  mi¬ 
croscopy  technique  that  exploits  programmable  transient  oligonucleotide  hybridization  -  on  synthetic 
DNA  nanostructures.  We  examined  the  effects  of  a  high  photon  count,  high  blinking  statistics  and  an 
appropriate  blinking  duty  cycle  on  imaging  quality,  and  developed  a  software-based  drift  correction 
method  that  achieves  <  l  nm  residual  drift  (root  mean  squared)  over  hours.  This  allowed  us  to  image 
a  densely  packed  triangular  lattice  pattern  with  fts  5  nm  point-to-point  distance  and  to  analyse  the 
DNA  origami  structural  offset  with  angstrom-level  precision  (2  A°)  from  single-molecule  studies. 
By  combining  the  approach  with  multiplexed  exchange-PAINT  imaging,  we  further  demonstrated 
an  optical  nanodisplay  with  5  x  5  nm  pixel  size  and  three  distinct  colours  with  <  1  nm  cross-channel 
registration  accuracy. 

5.2.3  Paper-based  synthetic  gene  networks 

Synthetic  gene  networks  have  wide-ranging  uses  in  reprogramming  and  rewiring  organisms.  To 
date,  there  has  not  been  a  way  to  harness  the  vast  potential  of  these  networks  beyond  the  constraints 
of  a  laboratory  or  in  vivo  environment.  Here,  we  present  an  in  vitro  paper-based  platform  that 
provides  an  alternate,  versatile  venue  for  synthetic  biologists  to  operate  and  a  much-needed  medium 
for  the  safe  deployment  of  engineered  gene  circuits  beyond  the  lab.  Commercially  available  cell- 
free  systems  are  freeze  dried  onto  paper,  enabling  the  inexpensive,  sterile,  and  abiotic  distribution  of 
synthetic-biology-based  technologies  for  the  clinic,  global  health,  industry,  research,  and  education. 
For  field  use,  we  create  circuits  with  colorimetric  outputs  for  detection  by  eye  and  fabricate  a  low- 
cost,  electronic  optical  interface.  We  demonstrate  this  technology  with  small-molecule  and  RNA 
actuation  of  genetic  switches,  rapid  prototyping  of  complex  gene  circuits,  and  programmable  in 
vitro  diagnostics,  including  glucose  sensors  and  strain-specific  Ebola  virus  sensors. 

5.2.4  Toehold  switches:  de-novo-designed  regulators  of  gene  expression 

Efforts  to  construct  synthetic  networks  in  living  cells  have  been  hindered  by  the  limited  number  of 
regulatory  components  that  provide  wide  dynamic  range  and  low  crosstalk.  Here,  we  report  a  class 
of  de-novo-designed  prokaryotic  riboregulators  called  toehold  switches  that  activate  gene  expres¬ 
sion  in  response  to  cognate  RNAs  with  arbitrary  sequences.  Toehold  switches  provide  a  high  level 
of  orthogonality  and  can  be  forward  engineered  to  provide  average  dynamic  range  above  400.  We 
show  that  switches  can  be  integrated  into  the  genome  to  regulate  endogenous  genes  and  use  them 
as  sensors  that  respond  to  endogenous  RNAs.  We  exploit  the  orthogonality  of  toehold  switches  to 
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regulate  12  genes  independently  and  to  construct  a  genetic  circuit  that  evaluates  4-input  AND  logic. 
Toehold  switches,  with  their  wide  dynamic  range,  orthogonality,  and  programmability,  represent  a 
versatile  and  powerful  platform  for  regulation  of  translation,  offering  diverse  applications  in  molec¬ 
ular  biology,  synthetic  biology,  and  biotechnology. 

5.2,5  Multiplexed  3D  cellular  super-resolution  imaging  with  DNA-PAINT  and  Exchange- 
PAINT 

Super-resolution  fluorescence  microscopy  is  a  powerful  tool  for  biological  research,  but  obtaining 
multiplexed  images  for  a  large  number  of  distinct  target  species  remains  challenging.  Here  we 
use  the  transient  binding  of  short  fluorescently  labeled  oligonucleotides  (DNDNA-PAINTINTINT, 
a  variation  of  point  accumulation  for  imaging  in  nanoscale  topography)  for  simple  and  easy-to- 
implement  multiplexed  super-resolution  imaging  that  achieves  sub-10-nm  spatial  resolution  in  vitro 
on  synthetic  DNA  structures.  We  also  report  a  multiplexing  approach  (Exchange-PAINTINTINT) 
that  allows  sequential  imaging  of  multiple  targets  using  only  a  single  dye  and  a  single  laser  source. 
We  experimentally  demonstrate  ten-color  super-resolution  imaging  in  vitro  on  synthetic  DNDNA 
structures  as  well  as  four-color  two-dimensional  (2D)  imaging  and  three-color  3D  imaging  of  pro¬ 
teins  in  fixed  cells. 

5.3  Software  Efforts  in  Kinetic  Self-Assembly 

5.3.1  DyNAMiC  Workbench:  An  Integrated  Development  Environment  for  Dynamic  DNA 
Nanotechnology 

Dynamic DNAnanotechnology  provides  a  promising  avenue  for  implementing  sophisticated  assem¬ 
bly  processes,  mechanical  behaviours,  sensing  and  computation  at  the  nanoscale.  However,  design 
of  these  systems  is  complex  and  error-prone,  because  the  need  to  control  the  kinetic  pathway  of  a 
system  greatly  increases  the  number  of  design  constraints  and  possible  failure  modes  for  the  sys¬ 
tem.  Previous  tools  have  automated  some  parts  of  the  design  workflow,  but  an  integrated  solution 
is  lacking.  Here,  we  present  software  implementing  a  three  OtierO  design  process:  a  high-level 
visual  programming  language  is  used  to  describe  systems,  a  molecular  compiler  builds  a  DNA  im¬ 
plementation  and  nucleotide  sequences  are  generated  and  optimized,  Additionally,  our  software 
includes  tools  for  analysing  and  OdebuggingO  the  designs  in  silico,  and  for  importing/exporting  de¬ 
signs  to  other  commonly  used  software  systems.  The  software  we  present  is  built  on  many  existing 
pieces  of  software,  but  is  integrated  into  a  single  packageNaccessible  using  a  Web-based  interface 
at  http://molecular-systems.net/workbench.  We  hope  that  the  deep  integration  between  tools  and  the 
flexibility  of  this  design  process  will  lead  to  better  experimental  results,  fewer  experimental  design 
iterations  and  the  development  of  more  complex  DNA  nanosystems. 

6  Major  problems/issues  (if  any) 

None. 
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7  Technology  transfer 

See  Sect.  9.4  "Patents". 

8  Foreign  collaborations  and  supported  foreign  nationals 

Foreign  Nationals:  Nikhil  Gopalkrishnan,  Alexander  Green,  Dongran  Han,  Ralf  Jungmann,  Jie 

Shen,  Yonggang  Ke,  Feng  Xuan,  Mingjie  Dai,  Johannes  Wohrstein,  Maximilian  Strauss  and  Juanita 

Lara 

Foreign  Collaborations:  N/A 

9  Productivity 

9.1  Refereed  journal  article 

1.  Thomas  E.  Schaus,  Sungwook  Woo,  Feng  Xuan,  Xi  Chen  and  Peng  Yin.  A  Biochemical 
Nanoscope  via  Auto-cycling  Proximity  Recording.  Submitted 

2.  Cameron  Myhrvold,  Michael  Baym,  Nikita  Hanikeh  Luvena  Ong,  Jonathan  Gootenberg  and 
Peng  Yin.  Barcode  Extension  for  Analysis  and  Reconstruction  of  Structures  ( BEARS J.  In 
preparation 

3.  Nikhil  Gopalkrishnan,  Thomas  Schaus  and  Peng  Yin,  Eliminating  Kinetic  Barriers  through 
Sequence  Design  Allows  Rapid  Assembly  of  DNA  Origami  at  Room  Temperature .  In  prepara¬ 
tion 

4.  R.  Jungmanna,  M.  S.  Avendanoa,  M.  Dai,  J,  B.  Woehrstein,  S.  S,  Agasti,  Z.  Feiger,  A.  Rodal 
and  P.  Yin.  Quantitative  super-resolution  imaging  with  qPAINT \  Nature  Methods,  13,  439-442 
(2016) 

5.  M.  Dai,  R.  Jungmann  and  R  Yin.  Optical  imaging  of  individual  biomolecules  in  densely 
packed  clusters.  Nature  Nanotechnology  (2016). 

6.  C.  Gmn,  J.  Werfel,  D.  Y.  Zhang,  and  R  Yin.  DyNAMiC  Workbench:  an  integrated  development 
environment  for  dynamic  DNA  nanotechnology.  J.  R.  Soc.  Interface,  12:  20150580(2015) 

7.  K.  Pardee,  A.  A.  Green,  T.  Ferrante,  E.  Cameron,  A,  Daley  Key  ser,  R  Yin,  and  J.J.  Collins. 
Paper-based  synthetic  gene  networks.  Cell  159:940-954 

8.  A.A.  Green,  RA.  Silver,  J.J.  Collins.  P.  Yin,  Toehold  switches:  de-novo-designed  regulators 
of  gene  expression.  Cell  159:  925-939  (2014) 

9.  B,  Wei,  L.L.  Ong,  J,  Chen,  A,  Jaffe,  R  Yin.  Complex  reconfiguration  of  DNA  nanostructures . 
Angewandte  Chemie  126:7605-7609  (2014), 

10.  J.  Sadowski,  C.R.  Calvert,  D.Y  Zhang,  N.A.  Pierce,  P.  Yin.  Developmental  self-assembly  of  a 
DNA  tetrahedron .  ACS  Nano  DOI:  10. 102 l/nn403 8223  (2014). 
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1  ] .  R.  Jungmanna,  M.S.  Avendanoa,  J.B,  Woehrsteina,  M,  Dai,  W.M,  Shift,  P.  Yin.  Multiplexed  3D 

cellular  super-resolution  imaging  with  DNA-PAINT  and  Exchange -PAINT.  Nature  Methods!  1  (3):3 13- 
318(2014) 

12.  B.  Wei,  M.  Dai,  C.  Myhrvold,  Y.  Ke,  R.  Jungmann,  P.  Yin.  Design  space  for  complex  DNA 
structures.  J.  Am.  Chem.  Soc.  135  (48):  18080-8  (2013) 

13.  C.  Myhrvold,  M.  Dai,  P.A.  Silver,  P.  Yin.  Isothermal  self-assembly  of  complex  DNA  structures 
under  diverse  and  biocompatible  conditions.  Nano  Letters  13(9):  4242-8  (2013) 

9.2  Non-refereed  journal  article 

None. 


9.3  Workshops  and  conferences 

1.  7th  international  symposium  on  Bioanalysis,  Biomedical  Engineering  and  Nanotechnology 
(ISBBN  2016),  Changsha,  China,  May  27th,  2016. 

2.  3rd  Materials  Beyond  Symposium,  Shanghai,  China,  May  26th,  2016. 

3.  Worcester  annual  nanotech  symposium  (NanoWorcester),  Worecester,  MA,  Apr.  14th,  2016. 

4.  Workshop  on  10  years  of  DNA  origami,  Pasadena,  CA,  March  14th,  2016. 

5.  Biophysical  Society  60th  Annua!  Meeting,  Los  Angeles,  CA,  Feb,  27th,  2016, 

6.  Department  of  Bioengineering,  Rice  University,  Houston,  TX,  Feb.  9th,  2016. 

7.  Amgen,  Inc.,  Cambridge.  MA,  Nov.  10th,  2015. 

8.  Molecular  Foundry,  Lawrence  Berkeley  National  Laboratory,  Berkeley,  CA,  Nov.  3rd,  2015. 

9.  Tenth  International  Meeting  of  Institute  for  Translational  Medicine  and  Therapeutics  at  Penn, 
Pennsylvania,  PA,  Oct.  1 1th,  2015. 

10.  Physical  Sciences  Symposia-2015  on  Quantum  Science  Symposium  and  Crystal/Graphene 
Science  Symposium,  Cambridge,  MA,  Sep.  22.  2015 

11.  2nd  ACM  International  Conference  on  Nanoscale  Computing  and  Communication,  Boston, 
MA,  Sep.  21st,  2015  [Keynote]. 

12.  DARPA  Workshop  on  Transient  Materials,  Arlington,  VA,  Sep,  2nd,  2015, 

13.  The  Second  Blavatnik  Science  Symposium,  New  York  City,  NY,  Aug.  6th,  2015. 

14.  The  Eleventh  International  Fab  Lab  Conference  (Fabl  1),  Boston,  MA,  Aug.  5th,  2015. 

15.  The  Third  Annual  Workshop  on  Micro-  and  Nanotechnologies  in  Medicine,  Cambridge,  MA, 
July  28th,  2015. 
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16.  Gordon  Conference  on  Nucleosides,  Nucleotides  and  Oligonucleotides,  Newport,  RI,  July  1st, 
2015. 

17.  Gordon  Conference  on  Synthetic  Biology,  Newry,  ME,  June  30th,  2015, 

18.  Albany  2015:  The  19th  Conversation,  Albany,  NY,  June  1 1th,  2015. 

19.  ECI  Conference  on  Photonics  for  Biology,  Medicine  and  Surgery,  Vail,  CO,  June,  7th,  2015. 

20.  1st  Synthetic  Biology  Young  Scholar  Forum  (SynBioYSF),  Beijing,  May  23rd,  2015  [Keynote]. 

21.  College  of  Chemistry  and  Molecular  Engineering,  Beijing,  May  22nd,  2015. 

22.  Yale  Chemical  Biology  Symposium,  New  Haven,  CT,  May,  15th,  2015. 

23.  Department  of  Materials  Science  and  Engineering,  Northwestern  University,  Evanston,  IL, 
May  12,  2015 

24.  First  Active  Matter  Summit,  Cambridge,  MA,  Apr.  24,  2015. 

25.  Harvard  Chinese  Life  Sciences  Annual  Research  Symposium,  Boston,  MA,  March  28,  2015. 

26.  SPIE  Photonics  West  BiOS,  San  Francisco,  CA,  Feb.  7th,  2015  [Keynote], 

27.  Frontiers  in  Quantitative  Biology  Seminar  Series,  Stanford  University,  Stanford,  CA,  Jan  1 5th, 
2015. 

28.  Joint  seminar  for  the  Graduate  Programs  in  Bioengineering  and  Pharmaceutical  Sciences  and 
Pharmacogenomics,  UCSF,  San  Francisco,  CA,  Jan  13th,  2015. 

29.  Department  of  Pathology,  Brigham  and  WomenOs  Hospital,  Boston,  Dec.  16th,  2014. 

30.  IEEE  EMBS  Micro  and  Nanotechnology  in  Medicine  Conference,  Oahu,  HI,  Dec.  9th,  2014. 

31.  Sino-US  Synthetic  Biology  Workshop,  Tianjin,  China,  Dec.  6th,  2014. 

32.  School  of  Life  Sciences,  Tsinghua  University,  Beijing,  China,  Dec.  5th,  2014, 

33.  Biodynamic  Optical  Imaging  Center,  Peking  University,  Beijing,  China,  Dec.  5th,  2014. 

34.  Fall  Symposium  of  the  New  England  Society  for  Microscopy,  Cambridge,  MA,  Nov.  20th, 
2014. 

35.  Department  of  Physics,  Columbia  University,  New  York,  NY,  Nov,  10th,  2014. 

36.  Department  of  Materials  Science  and  Engineering,  Boise  State  University,  Boise,  ID,  Nov. 
7th,  2014. 

37.  Department  of  Chemical  and  Biomolecular  Engineering,  Lehigh  University,  Bethlehem,  PA, 
Oct.  29th,  2014, 

38.  Naval  Research  Laboratory,  Washington,  DC,  Oct.  20th,  2014. 


12 


39.  Fourth  U.S. -China  Symposium  on  Nanobiology  and  Nanomedicine,  Washington,  DC,  Oct. 
1 7th,  2014. 

40.  Twenty  Ninth  Chinese  Chemical  Society  Annual  Conference,  Beijing,  China,  Aug.  5th,  2014. 

4 1 .  EITA-New  Media  and  Bio,  Cambridge,  MA,  July  3 1  st,  201 4. 

42.  Second  Annual  workshop  on  Micro-  and  Nanotechnologies  in  Medicine,  Cambridge,  MA, 
July  31st,  2014. 

43.  SEED2014  (Synthetic  Biology.  Engineering,  Evolution  and  Design),  Manhattan  Beach,  CA, 
July  14th,  2014  [Young  Investigator  Award  Lecture]. 

44.  Blavatnik  Science  Symposium,  New  York  City,  NY,  July  8th,  2014. 

45.  Seventh  world  congress  on  biomechanics  (DNA  mechanics  and  assembly  session),  Boston, 
MA.  July  6th,  2014. 

46.  Seventh  world  congress  on  biomechanics  (Nucleic  acid  nanostructures  session),  Boston,  MA. 
July  6th,  2014. 

47.  BIO  International  Convention,  San  Diego,  CA,  June  25th,  2014. 

48.  International  Workshop  on  DNA-Based  Nanotechnology:  Digital  Chemistry  (DNATEC14), 
Dresden,  Germany,  May  5th,  2014. 

49.  Oncology  Research  Seminars,  Koch  Institute  for  Integrative  Cancer  Research.  MIT.  Cam¬ 
bridge,  MA,  Apr.  28th,  2014. 

50.  Foundation  of  Nanosciences  (FNANO20 14),  Snowbird,  Apr.  16th,  2014. 

51.  Biological  Imaging  Meeting,  Northwestern  University,  Evanston.  March  28th,  2014. 

52.  National  Academy  of  Sciences  Frontiers  of  Engineering  and  Sciences  Program,  Rio  de  Jane- 
rio,  Brazil,  March  17th,  2014. 

53.  Biophysics  seminar,  MIT,  Cambridge,  MA,  Feb.  12th,  2014. 

54.  NSF  Nanoscale  Science  and  Engineering  Grantees  Conference:  Current  and  Future  Trends  in 
Nanotechnology  and  Environment  and  Nanomanufacturing,  Arlington,  VA.  Dec.  6th,  2013. 

55.  The  Nano  Area  of  Advance  at  Chalmers  University  of  Technology,  Goteborg,  Sweden,  Nov. 
28th,  2013. 

56.  Fourth  International  Congress  on  Biotechnology  and  Genomics,  Monterrey,  Mexico,  Nov. 
2 1  st,  2013. 

57.  NIH  Common  Fund  High-Risk  High-Reward  Symposium,  Bethesda,  MD,  Nov.  18th,  2013. 

58.  Department  of  Chemistry  and  Biotechnology,  Tokyo  University,  Tokyo,  Japan,  Oct.  30th, 
2013. 
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59.  Annual  Meeting  of  Chem-Bio  Informatics  Society,  Tokyo,  Japan,  Oct.  30th,  2013. 

60.  Department  of  Chemistry,  Kyoto  University,  Kyoto,  Japan,  Oct.  29th,  2013. 

61 .  Annual  Meeting  of  the  Biophysical  Society  of  Japan,  Kyoto,  Japan,  Oct.  29th.  2013. 

62.  U.S.  Army  Research  Office  Workshop:  Challenges  and  Opportunities  in  Nanomanufacturing, 

Raleigh,  NC,  Oct.  1st,  2013. 

63.  International  Conference  on  DNA  Computing  and  Molecular  Programming,  Tempe,  Sep. 

27th,  2013. 

64.  Symposium  on  Advanced  Nano/Biosystems:  Design,  Fabrication,  and  Characterization,  Uni¬ 
versity  of  Illinois  at  Urbana-Champaign,  Urbana,  IL,  Sep.  25th,  2013. 

65.  The  Ninth  International  Fab  Lab  Conference,  Yokohama,  Japan.  Aug.  21st,  2013. 

66.  NSF  Workshop  on  BioMEMS  and  Tissue  Engineering,  Cambridge,  July  30th,  2013. 

67.  NSF  Workshop:  Advanced  Biomanufacturing,  Arlington,  VA.  July  14,  2013. 

68.  The  Sixth  International  Meeting  on  Synthetic  Biology  (SB  6.0),  London.  England.  July  10th, 

2013 

69.  Si  no-US  Synthetic  Biology  Workshop,  Tianjin,  China,  June  21st,  2013 

70.  Fourth  International  Wyss  Symposium:  Nanotherapeutics  and  Diagnostics,  Boston,  June  6th, 

2013 

9.4  Patents 

•  Yin,  Peng;  Jungmann.  Ralf;  Dai,  Mingjie;  Avendano,  Maier;  Wohrstein,  Johannes,  Quantita¬ 
tive  DNA-Based  Super- Resolution  Imaging  US  14/908,333.  Pending 

•  Yin,  Peng;  Sun,  Wei;  Shen.  Jie.  Using  DNA  Nano  Structure  as  Reactive  Ion  Etching  Mask  for 
Nanoelectronics  Fabrication  PCT/US 15/32 198.  Pending 

•  Yin,  Peng;  Agasti,  Sarit;  Chen,  Xi;  Jungmann.  Ralf.  High  Throughput  and  Highly  Multiplexed 
Imaging  with  Programmable  DNA  Probe  US  15/108,91 1.  Pending 

•  Yin,  Peng;  Han,  Dongran;  Myhrvold,  Cameron.  Single-Stranded  DNA  Nanostructures  PCT/US  16/20893. 
Pending 

•  Yin.  Peng;  Chen,  Xi;  Schaus,  Thomas.  Microscope-Free  Imaging  PCT/US  16/1 5503.  Pending 

•  Yin,  Peng;  Agasti,  Sarit;  Gopalkrishnan,  Nikhil;  Yaghi,  Omar  Target  Detection  Using  Barcode 
Conjugates  and  Nucleic  Acid  Nanoarrays  62/264,821.  Pending 

•  Yin,  Peng;  Chen,  Xi;  Zhang,  Mengmeng,  Nucleic  Acid  Retro-Activated  Primers  PCT/US  16/25480. 
Pending 
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•  Yin,  Peng;  Silverberg,  Jesse.  Molecular  Atlas  Project  -  Software  and  Netware,  Disclosure 

•  Yin, Peng;  Gutierrez,  Juanita  Maira  Lara;  Amado,  Maier  S,  Avendano,  Proximity  PAINT: 
DNA-Based  Super  Resolution  Imaging  of  Protein-Protein  Interactions  62/202,327,  Pending 

•  Yin,  Peng;  Jungmann,  Ralf;  Wohrstein,  Johannes.  Kinetic  Barcoding.  Disclosure 

•  Yin,  Peng;  Ong,  Luvena;  Simmel,  Friedrich;  Scheible,  Max.  Compact  DNA  Cubes,  Disclo¬ 
sure 

•  Yin,  Peng;  Kim,  Jongmin.  Synthetic  biological  circuits  for  transient  materials  62/213,217. 

Pending 

•  Yin,  Peng;  Kishi,  Jocelyn;  Schaus,  Thomas;  Xuan,  Feng;  Gopalkrishnan,  Nikhil;  Woo  Sung- 
wook.  Primer  exchange  reactions  (PER)  62/296,310;  62/299,206.  Pending 

•  Yin,  Peng;  Beliveau,  Brian;  Kishi,  Jocelyn.  Probe  Miner:  A  rapid,  genome-scale  oligo  FISH 
probe  discovery  pipeline.  Disclosure 

•  Yin,  Peng;  Schaus,  Thomas;  Gopalkrishnan,  Nikhil.  Sequence  design  for  efficient  assembly 
of  nucleic  acid  structures  62/321 , 146.  Pending 

•  Yin,  Peng;  Xuan,  Feng;  Dai,  Mingjie;  Chen,  Xi.  Conditional  Primer  Extension  for  Single- 
Molecule  Protein  Detection  62/342,401.  Pending 

•  Yin,  Peng;  Kishi,  Jocelyn;  Xuan.  Feng.  Primer  Exchange  Reactions  (PER)  for  Isothermal 
PCR  62/342,295.  Pending 

9.5  Awards/Honors 

•  World  Economic  Forum  Young  Scientist  Award,  2015  Press: https  :  / /agenda  .we forum,  or g/ 
ne ws/ world- economic- forum-honours- its- 2 015- young- scientists -community-at^ 
annual -meet ing-of- the- new- champions/ 

•  Blavatnik  National  Awards  for  Young  Scientists,  Finalist,  2015  Press:http://www.prnewswire. 
com/ news- release s/blavatnik-national-a wards -f or- young- scientists- announce- 
2015-f inalists-300087851 „ html 

•  World  Economic  Forum  Young  Scientist  Award,  2014  Pressihttp :  //www.  wef  orum  .  org/news/ 
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new-chartpions 

•  Inaugural  ACS  Synthetic  Biology  Young  Scientist  Award,  2014  Press:http: //wyss  .harvard. 
edu/viewpage/540/ 

•  Blavatnik  National  Awards  for  Young  Scientists,  Finalist,  2014  Press:http :  //blavatnikawards  . 
org/news /items /blavatnik- national -awards- announce- 30- finalists/ 

•  NIH  Transformative  Research  Award,  2013  Pressihttp :  /  /wyss . harvard.edu/viewPressrelease/ 
126/ 
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•  NSF  Expedition  in  Computing  Award,  2013  Press: http  ://www.  ns  f .  gov/news /news^suram. 
jsp?cntn_id=128886 


10  Award  participants 

Military  Personnel:  N/A 

Salary  Support:  Peng  Yin,  Nikhil  Gopalkrishnan,  Alexander  Green,  Dongran  Han,  Ralf  Jungmann, 
Jie  Shen,  Yonggang  Ke,  Feng  Xuan,  Luvena  Ong,  Mingjie  Dai,  Cameron  Myhrvold,  John  Paul 
Sadowski,  Johannes  Wohrstein,  Maximilian  Strauss,  Juanita  Lara  and  Anita  Chandrahas, 


11  Appendix 

The  relevant  reprints,  preprints  and  technical  reports  are  included  in  this  appendix. 
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•  R.  Jungmannt,  M.  S.  Avendanof,  M.  Dai,  J.  B.  Woehrstein,  S.  S.  Agasti,  Z.  Feiger,  A.  Rodal  and 
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2014 
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•  A.A.  Green,  P.A.  Silver,  J.J.  Collins,  P.  Yin.  Toehold  switches:  de-novo-designed  regulators  of 
gene  expression.  Cell  159:  925-939  (2014) 

•  B.  Wei,  L.L.  Ong,  J.  Chen,  A.  Jaffe,  P.  Yin.  Complex  reconfiguration  of  DNA  nanostructures. 
Angewandte  Chemie  126:7605-7609  (2014). 

•  J.  Sadowski,  C.R.  Calvert,  D.Y.  Zhang,  N.A.  Pierce,  P.  Yin.  Developmental  self-assembly  of  a 
DNA  tetrahedron.  ACS  Nano  DOI:  10.1021/nn4038223  (2014). 

•  R.  Jungmannt,  M.S.  Avendanof,  J.B.  Woehrsteinf,  M.  Dai,  W.M.  Shih,  P.  Yin.  Multiplexed  3D 
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A  Biochemical  Nanoscope  via  Auto-cycling  Proximity  Recording 
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Abstract  Analysis  of  the  spatial  arrangement  of  molecular  features  enables  the  engineering  of  synthetic  nanostructures  and  the  7 
understanding  of  natural  ones.  Microscopy  methods  can  provide  direct  visualization  of  complex  spatial  features  but  typically  require 
sophisticated  equipment  with  limited  throughput.  Here,  we  present  a  micro  scope- free  “biochemical  nanoscopy”  method  that  records 
sophisticated  nanostructure  features  in  situ  for  later  readout.  The  method  is  based  on  a  conceptually  novel  Auto-cycling  Proximity 
Recording  (APR)  mechanism,  which  continuously  and  repeatedly  produces  proximity  records  of  any  nearby  pairs  of  DNA-barcoded 
probes,  at  physiological  temperature,  without  destroying  the  probes  themselves.  We  demonstrate  the  proximity  and  auto-cycling 
characteristics,  and  then  apply  the  system  as  a  biochemical  nanoscope  to  accurately  decode  various  spatial  arrangements  of  7  unique 
probes  in  homogeneous  test  samples.  We  further  demonstrate  the  ability  of  APR  to  re-sample  the  same  dynamic  system  repeatedly,  v* 
Future  development  is  expected  to  enable  readout  with  massively  parallel  sequencing  methods,  creating  a  truly  single-molecule 
technique. 


A  fo  mutation  a!  tool  for  studying  nanostructure 

Spatial  organization  is  fundamental  to  the  function  of  syn¬ 
thetic1^5  and  biological^8  nanostructures.  The  organization  of 
such  nanostructures  can  be  described  as  a  set  of  pairwise  prox¬ 
imities  between  molecular  components.  Single-molecule  meth¬ 
ods  to  study  proximity  and  organization  must  examine  individ¬ 
ual  nanostructures  with  molecular-scale  precision,  and  are  foun¬ 
dational  to  advancing  nanoscience.9 

Though  limited  to  multiplexing  a  modest  number  of  simuL 
-  taneous  species,  direct  visualization  by  electron,  atomic  force, 
and  optical  microscopy  have  identified  individual  macromolee- 
ular  associations  in  synthetic4  "10  and  biological1 1-13  systems* 
and  achieve  molecular  resolution  in  controlled  environments,14 
Resonance  energy  transfer  (FRET)  techniques  further  allow  for 
dynamic  measurement  of  pairwise  proximity  in  solution.15*14 
Complementary  to  the  sophisticated  microscopy  platforms 
are  biochemical  techniques  that  have  long  enabled  molecular- 
scale  precision,  but  typically  make  only  ensemble  measurements 
of  simple  pairwise  interactions.  Techniques  such  as  Yeast  Two- 
Hybrid n"1 9  and  related20  assays,  affinity  purification  /  mass 
spectroscopy,21  and  Co-Irnmuno  Precipitation22  are  common  in 
elucidating  protein  interaction  networks,  and  are  relatively  eas¬ 
ily  parallelized,  executed,  and  even  automated. 

Barcoding  with  DNA  has  recently  given  biochemical  tech¬ 
niques  the  potential  for  enormous  multiplexing.  Proximity  Lig¬ 
ation  (PL  A)24  or  Extension  Assays  (PEA),24  which  log  the  co- 
localization  of  two  probes  by  ligating  or  extending  probe-bound 
DNA  sequences,  now  routinely  multiplex  *-100  signals  with  or¬ 
thogonal  sequences.  The  information  content  of  DNA  itself 
^  Ls  much  larger,  however.  There  are  4N  combinations  of  N 
nucleotides,  enabling  1  million-plex  with  only  10- nucleotide 
strings.  The  destructive  nature  of  PLA  and  PEA,  wherein  only 
a  single  association  can  be  measured  for  each  probe  pair,  limits 
the  utility  of  unique  barcoding  in  these  methods;  it  is  possible  to 
measure  pairings  uniquely,  but  not  to  connect  those  unique  pairs 
into  large  structures  on  the  single  molecule  level. 

Here,  we  present  a  novel  biochemical  interrogation  technique, 
termed  “Auto-cycli  ng  Proxi mity  Recording”  (A PR) ,  for  the  con- 
-j:s  tinuous  generation  of  molecular  records  of  target  proximity 
(Fig.  1 ).  In  contrast  to  existing  methods,  proximity  data  are  gen¬ 
erated  continuously  and  repeatedly,  at  tunable  distances  and  un¬ 
der  physiological  conditions,  by  non-destructive ly  copying  pairs 
of  target -bound,  DNA-barcoded  probes  into  each  new  molecu¬ 
lar  record.  APR  thus  allows  for  re- sampling  of  altered  molec¬ 
ular  arrangement,  a  high  signal-to-noise  ratio,  and  the  potential 
for  developing  single  molecule  techniques.  The  resulting  set  of 
proximity  pairs  represents  a  rich  set  of  information  from  which 
to  reconstruct  spatial  organization,  and  allows  us  to  demonstrate 
a  “biochemical  nanoscope”  that  decodes  the  complex  spatial  ar¬ 
rangements  of  7  unique  APR  probes  on  a  homogeneous  test 
sample.  Like  other  DNA  barcoded  methods.  APR  is  in  principle 
compatible  with  massively -para lie!  sequencing  readout  Future 
development  will  enable  the  APR  nano  scope  to  be  a  truly  single- 
molecule  technique,  bridging  microscopy's  capacity  for  analyz¬ 
ing  individual  nanostructures  with  the  molecular- sc  ale  precision 
and  broad  applicability  of  biochemical  techniques. 

The  APR  mechanism 

The  APR  reaction  proceeds  in  an  autonomous,  cyclic  fash- 
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Figure  I:  Concept,  From  a  cluster  of  labeled  targets  (center),  A u to- 
cycling  Proximity  Recording  (APR,  right)  generates  repeated  records 
of  any  nearby  probe  pairs  by  copying  sequences  encoded  in  the  probes. 
This  allows  re-sampling  of  the  same  targets  in  different  organization  on 
in  principle,  full  reconstruction  of  an  individual  cluster.  In  contrast,  cur¬ 
rent  analysis  techniques  (left)  provide  limited  macromolecular  geome¬ 
try  information  and  typically  destroy  associations  as  they  are  measured, 
precluding  repeated  sampling. 


ton  to  convert  soluble  primers  into  molecular  "‘records”  of  probe 
pair  proximity  (Fig.  2a),  DNA  Hairpin  probes  Pi  and  Pj  arc 
first  attached  to  target  molecules  Tt  and  Tj  via  antibodies,  ap- 
tamers.  direct  nucleic  acid  hybridization,  or  other  means,  and 
encode  the  identity  of  the  target  type  within  unique  primer  bind¬ 
ing  sequences  (labeled  as  sequence  domains  at  and  aj ),  In  step 
(i),  the  soluble  primers  (sequences  a*  and  aj)  stably  bind  their 
respective  hairpin  probe  type  and  are  extended  by  a  polymerase 
through  the  spacer  (s)  and  palindromic  (p)  domains  up  to  the 
“stopper”  site,  thereby  producing  “Half-records”  (a*  -  s*  -  p* 
and  a *  -  s*  -  p* ).  (ii)  Half-records  arc  partially  displaced  by 
spontaneously  re-forming  hairpin  stems  {$*  —p*  /  p  —  s)  and  (iii) 
bind  any  nearby  Half-records  at  their  3'  palindromic  (/Y)  seg¬ 
ments.  (iv)  These  sequences  are  again  extended,  this  time  over 
the  partner  spacer  s*  and  primer  a*  segments,  e fleeting  the  re¬ 
lease  of  soluble,  double-stranded  “Full -records”  that  carry  both 
probe  identities  as  sequences  a*  and  aj,  Probes,  thus  regener¬ 
ated  to  their  initial  state,  may  undergo  additional  cycles  in  the 
same  or  other  pairings.  Upon  termination  of  this  cyclic  record¬ 
ing  reaction,  a  specific  Full -record  is  tested  for  hy  sample  PCR 
amplification  with  the  respective  primer  pair  (ah  a j)  and  subse¬ 
quent  detection  of  Full-record  copies  by  gel  electrophoresis,  in 
this  manner,  for  example,  an  isolated  sample  with  PCR  primers 
for  a,  and  dj  would  yield  a  positive  result  in  the  presence  of 
Tj-Tj  proximity. 


This  cycle  solves  two  main  challenges  in  generating  multi¬ 
ple  records  between  any  nearby  pairs:  (1)  a  template  sequence 
(in  this  case,  s*  -  p*)  must  be  copied  from  the  probe  and  then 
spontaneously  made  single -stranded  and  available  for  further  re¬ 
action,  and  (2)  in  the  context  of  probe  proximity,  pairs  of  copied 
sequences  must  be  combined  into  a  single  molecular  record  and 
then  released. 

The  first  challenge  is  solved  by  a  modified,  DNA- based 
“Copy -and- Release  Hairpin"  (CRH)  that  enables  the  isother¬ 
mal  copy  and  release  of  an  arbitrary  sequence  onto  each  primer 
strand.  The  detailed  mechanism  is  as  follows  (Fig.  2b):  (i)  DNA 
primer  strand  aw  -  Ip  —  b*  (i.e.f  of  domains  a*  and  6*  with 
intervening  single  T  nucleotide  linker  lj)  binds  the  3’  end  of 
the  haiqun  at  primer  binding  domain  a.  (ii)  The  primer  and 
hairpin  b*  domains  compete  for  short  probe  domain  fr,  Primer 
domain  h*  occasionally  binds,  despite  the  resulting  single  T  nu¬ 
cleotide  lj  bulge,  and  is  extended  by  a  displacing  polymerase 


Figure  2:  Autn-cy  cling  Proximity  Recording  (APR)  mechanisms. 

(a)  The  APR  cycle  creates;  molecular  records  of  proximity  from  pairs 
of  hairpin  probes.  Probe -specific  primers  are  (i)  extended  to  “Half- 
records  ’  and  (ii)  reversibly  displaced,  (m)  bind  palindromic  domains 
of  nearby  Half-records,  and  (iv)  are  extended  on  each  other  to  create 
and  displace  "Full-records  ”  regenerating  the  probes.  See  text  for  de¬ 
tailed  description  and  SI  Note  1  for  other  considerations,  (h)Copy-and- 
Releuse  Hairpin  (CRH)  detail  Shown  is  the  mechanism  of  (i)  initial 
primer  binding,  (ii)  extension,  and  (iii)  random  walk  of  the  strand  dis¬ 
placement  branch.  The  hairpin  template  strand  t  —  b  is  computationally 
predicted  to  pair  predominantly  with  the  hairpin  stem  complement  (iii, 
inset). 

(here,  Bst,  New  England  Biolabs)  through  stem  template  domain 
t.  When  primer  extension  reaches  a  “stop peri'  modification  near 
the  loop  (here,  covalent  linker  triethylene  glycol  “Spacer  9  “  In¬ 
tegrated  DNA  Technologies,  opposite  single  T  lr)>  the  poly¬ 
merase  dissociates,  (iii)  The  new  extension  and  the  displaced 
hairpin  stem,  being  identical  in  sequence  (b*  —  f*)>  rapidly  com¬ 
pete  for  stem  template  domains  t  —  b  via  the  strand  displace¬ 
ment  mechanism25  with  millisecond  transit  times.26 :  Com- 
.  :  puter  models  (NUPACK-S)  predict  that  the  stem  template  t  —  h 
is  more  likely  to  he  bound  to  its  stem-based  complement  than 
to  the  new  primer  extension  (tii,  plot,  as  well  as  Si  Note  1 ).  en¬ 
suring  the  Half-record  is  largely  single-stranded  and  available 
for  further  reaction.  In  contrast,  because  sequence  ft*  is  longer 
than  -11  nt,  the  Half-record  remains  stably  bound  at  the  primer 
binding  site  a  until  forcibly  displaced  by  a  polymerase. 

The  advance  of  this  copy -and -re  I  ease  mechanism  is  not  in 
copying  a  new  sequence,  of  course,  but  in  autonomously  dis¬ 
placing  that  stably-bound  sequence  to  reveal  a  single-stranded 
product  without  external  thermal  or  chemical  influence.  Simply 
extending  a  primer  on  a  single-stranded  template,  as  in  PCR. 


would  leave  the  product  stably  bound  without  external  manipu¬ 
lation.  and  therefore  unavailable  for  further  reaction.  Here,  tem¬ 
plate  strand  t — h  is  bound  to  a  complementary  hairpin  competitor 
strand  b*  —  £*.  The  free  energy  released  from  dNTP  additions, 
in  the  presence  of  a  displacing  polymerase,  allows  the  otherwise 
unfavorable  hairpin  opening  and  template  copying.  Once  the 
polymerase  is  released  at  the  stopper,  the  hairpin  stem  can  close 
again  to  displace  the  new  strand. 

The  details  of  the  CRH  probe  are  important  to  optimal  func¬ 
tioning.  The  Spacer9  stopper  modification,  opposite  an  unpaired 
“T"  (It)  to  create  a  small  internal  loop  adjacent  to  .r.  was  cho¬ 
sen  empirically  for  its  shun.  -1  nt  length  and  its  ability  to  con¬ 
sistently  stop  the  polymerase  in  this  context.  Even  at  this  short 
length,  however,  the  internal  loop  slows  the  ability  of  l he  hair¬ 
pin  competitor  strand  bm  —  tm  to  re-bind  template  strand  t  —  b 
and  displace  new  sequence  b*  -  Thermodynamic  calcula¬ 
tions  (NUPACK.  see  SI  Fig.  1)  and  experimental  evidence  (not 
shown)  confirm  that  this  leads  to  new  strand  extension  b*  —  t* 
preferentially  binding  template  /  -  b  and  unavailable  for  fur¬ 
ther  reaction.  The  unpaired  nucleotide  within  the  a*  —  It  —  h* 
primer  sequence  was  added  to  balance  this  preference  (Fig.  2b, 
ii),  adding  an  even  weaker  bulge  loop  to  the  other  end,29  Fur¬ 
thermore.  to  minimize  secondary  structure  and  prevent  spuri¬ 
ous  binding  and  extension  of  the  single- stranded  new  sequence 
b*  —  t*  (and  other  segments),  the  primer-binding  (b  —  a)  and 
template  {/.)  domains  utilize  only  three  base  types  (A,  T.  and  G). 

See  SI  Note  1  for  other  considerations.  ■  > 

'Hie  second  challenge,  linking  these  Half-records  together  in 
the  context  of  proximity,  and  then  releasing  a  single  Full -record, 
is  solved  by  copying  a  short,  6  nucleotide  palindromic  sequence 
p  (AGGCCT)  to  the  T  end  of  the  Half-record.  Here,  a  palin¬ 
drome  is  defined  such  that  p  =  p\  enabling  any  Half-record  to 
bind  to  any  other  Half-record  by  their  3’  ends.  Subsequent  ex¬ 
tension  by  the  polymerase  in  solution  copies  the  information  on 
each  Half-record  onto  the  other,  as  well  as  ultimately  displaces 
and  releases  the  nascent  Full-records  from  the  probes.  Note  that 
the  non-specific  reaction  of  unbound  Half- record  is  prevented  by 
two  features:  (he  p  domain  is  not  part  of  the  soluble  primer  but 
only  added  upon  Half-record  formation,  and  these  Half-record 
are  only  released  once  they  pair.  Both  features  limit  reactive, 
/^-containing  strands  free-floating  in  solution.  See  SI  Fig,  2  for 
further  probe  detail  and  sequences. 

Proof  of  principle  and  characterization 

Figure  3  presents  the  proof  of  principle  experiments,  demon¬ 
strating  proximity -based,  auto-cycling.  Full-record  generation. 

Fig,  3a  focuses  on  the  proximity  dependence  of  reactions.  Lanes 
1  -4  each  contained  combinations  of  2  probe  types,  made  dif¬ 
ferent  only  in  spacer  length  for  the  purpose  of  differentiat¬ 
ing  products  on  gel  electrophoresis,  together  with  a  common 
Cy5  lluoroph ore -labeled  primer  and  polymerase  (not  depicted 
at  top).  In  lane  L  short  and  long  barcode  probes  existed  un¬ 
bound  in  solution.  The  primers  (red,  in  excess)  were  extended 
to  short  or  long  Half-records,  respectively,  but  no  Full-records 
were  generated  because  probes  were  not  in  close  proximity  of 
each  other.  In  lane  2,  short  barcode  probes  were  held  in  prox¬ 
imity  on  streptavidin  molecules,  and  the  system  produced  short 
Half- records  and,  in  turn,  short  Full- records,  as  expected.  When 
streptavidin  was  prepared  in  separate  vessels  with  either  short  or 
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Figure  3:  Proof  of  principle  experiments,  fa)  Generation  of  Full- 
records  requires  the  co- localization  of  probes,  here  by  biotinylated  hair¬ 
pin  loops  bound  to  streptavidin,  whereas  isolated  probes  can  always 
generate  Half- records.  Cropped  denaturing  PAGE  gel  depicting  10  jul 
reactions  (40  min  at  37°C)  with  biotin-sireptavidin  association  and  4:1 
overall  probe  streptavidin  stoichiometry  (inset),  8  and  22  nt  barcodes 
{19  and  33  nt  stem  lengths  copied),  10;  I  primer: probe,  and  40  nM  total 
probe  concentration.  A  single  primer  sequence  was  used  and  no  sec¬ 
ondary  amplification  was  performed,  (fo)  Auto-cycling  is  also  demon¬ 
strated  by  quantification  of  Cy 5 -labeled  probes  on  cropped  denaturing 
PAGE  gels.  Reaction  and  gel  conditions  identical  to  those  of  (a)  excepi: 
10  nt  spacer  (21  nt  stem  length  copied).  5:1  primerprobe  with  probes 
still  at  40  nM  total.  Light  green  dots  are  from  gc]  shown,  dark  green 
from  repeat  experiment.  See  probe  details  and  sequences  in  SI  Fig.  2 
and  full  gels  in  SI  Fig.  3. 


long  probes,  and  then  mixed  together  for  reaction  (lane  3),  both 
Half-records  and  Full-records  of  short  and  long  lengths  were 
generated.  Interaction  among  probes  from  di  fferent  streptavidin 
I-*  molecules  would  have  also  produced  Full-records  of  interme¬ 
diate  length  (he.,  of  long  +  short  Half-records),  but  these  were 
instead  only  present  when  short  and  long  probes  were  randomly 
arranged  on  the  same  streptavidin  molecules,  as  in  lane  4. 

Figure  3b  demonstrates  the  cycling  ability  of  the  probes  by 
observing  product  formation  over  time.  Because  primers  were 
each  labeled  with  a  single  copy  of  Cy5,  the  amount  of  product 
in  each  band  could  be  quantified.  Beginning  with  a  5-fold  abun¬ 
dance  of  primers  with  respect  to  probes,  probes  unbound  in  so¬ 
lution  generated  few  Full-records  of  background  signal.  When 
held  on  streptavidin,  however,  record  quantification  reveals  that 


each  probe  was  read  multiple  times  on  average,  with  an  initial 
record  generation  rale  of  one  per  7  min.  Conclusive  proof  that  u 
given  individual  probe  sequentially  generates  records  with  mul¬ 
tiple  different  partner  probes  was  provided  by  massively  parallel 
sequencing  (see  SI  Fig.  4). 

Probe  design  dictates  the  “reach”  of  a  probe  pair,  or  probe- 
probe  proximity  w  ithin  which  Full -records  can  be  produced.  To 
characterize  the  rates  of  Full- record  production  as  a  function  of 
probe-probe  distance,  pairs  of  probes  were  fixed  to  programmed 
positions  on  2D  DNA  origami  structures^  (Fig.  4a).  Three  types 
of  probes  were  tested,  encompassing  three  probe  spacer  lengths 
and  two  attachment  methods.  Two  types  were  attached  as  ex¬ 
tensions  of  intrinsic  origami  ‘"staple"'  strands  (Fig.  4a,  left  inset), 
while  a  third  was  held  by  a  click-chemistry  azide-akyne  link¬ 
age  to  an  intermediate  strand  (right  inset)  to  demonstrate  how 
probes  may  be  attached  to  arbitrary  moieties.  Both  methods  in¬ 
corporated  single -stranded  DNA  as  flexible  linkers  at  the  base 
of  each  probe.  Figure  4b  diagrams  the  flexible,  single- stranded 
and  the  rigid,  do  able- stranded  components  contributing  to  the 
reach  between  two  probes.  ^ 

Recording  reactions  for  each  probe  and  separation  distance 
were  carried  out  separately.  Many  copies  of  a  given  origami 
structure  were  held  fiat  and  immobile  on  a  mica  surface,  as  is 
done  for  atomic  force  microscopy  of  DNA  structures  (see  SI 
Note  2  and  Si  Fig.  7  for  details  of  how'  this  stabilizes  structures). 
Records  were  then  amplified  by  PCR,  and  products  quantified 
by  gel  electrophoresis.  To  account  for  variation  in  experimen¬ 
tal  conditions,  especially  in  the  number  of  origami,  a  second 
probe  pair  w'ith  orthogonal  PCR  sequences  was  present  on  each 
origami  type,  but  always  at  the  same  fixed  separation  distance 
(Fig.  4a),  The  rate  of  production  was  calculated  with  respect  to 
this  reference  pair. 

Figure  4c  show's  the  relative  recording  rate  for  three  probe  de¬ 
signs,  containing  0,  12,  and  18  nt  spacer  domains  s.  Each  probe 
pair  was  tested  every  6  nm  for  separations  of  6  to  48  na  and 
mathematical  fits  to  record  generation  rates  were  normalized  to 
a  common  maximum.  Zero,  12,  and  18  nt  probes  all  produced 
records  at  near  maximum  rales  when  pairs  were  closest  to  one 
another.  Rates  were  reduced  to  half  at  13,  20.  and  25  nm,  re¬ 
spectively,  and  had  a  maximum  reach  (1%  of  maximum  record¬ 
ing  rate)  at  17*  39,  and  44  nm.  In  absolute  terms,  the  0  nt  spacer 
probe  produced  records  at  the  slowest  rate,  approximately  half 
as  fast  as  longer  probes  (not  shown). 

All  three  maximum-reach  distances  correspond  well  to  the 
values  expected  when  DNA  probes  and  attached  Half-records 
are  oriented  optimally,  as  in  a  straight  chain  (Fig,  4b),  For  the 
0  nt  spacer  probe,  the  maximum  expected  distance  is  approx¬ 
imately  twice  the  sum  of  the  probe  length  (19  bp  of  double- 
stranded  DNA,  at  0.34  nm  per  base  pair,-*-  -6,5  nm)  and  Half- 
record  length  (II  m  of  single- stranded  DNA  minus  3  nl  palin¬ 
drome  overlap,  at  the  maximum  0,58  nm  per  base*1  -4,6  nm), 
totaling  -22  nm.  Similarly,  the  12  nt  and  18  nt  spacer  probes 
have  a  maximum  expected  reach  of  *36  and  *43  nm,  respec¬ 
tively.  See  SI  Note  3  and  SI  Fig,  8  for  estimates  of  reach  profiles 
using  more  sophisticated  Worm-Like  Chain  models. 

Application:  Interrogation  of  complex  geometries 

Auto-cycling  Proximity  Recording  can  be  used  to  interrogate 
the  relative  proximities  of  a  group  of  targets,  enabling  recon- 
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Figure  4:  Probe  reach,  (a)  To  characterize  reach,  relative  probe  positioning  is  controlled  precisely.  Probes  were  attached  to  2D*  rectangular, 
twist -corrected  DNA  origami  structures  measuring  -70  by  IOO  nm  (origami  shown  with  ail  possible  Fig.  4  probe  positions*  and  equivalent  cartoon 
with  4  probes;  see  SI  Fig,  5  for  geometry*  SI  Table  1  for  staple  sequences*  and  SI  Fig,  6  for  precise  probe  positions).  After  complete  assembly* 
origami  were  deposited  randomly  on  mica  surfaces  in  the  presence  of  12*5  mM  Mg:+  for  firm  adhesion*  a  condition  we  find  protects  origami  from 
degradation  by  polymerase  (see  SI  Note  2),  Each  probe  was  held  by  direct  extension  of  two  origami  staples  (inset,  left)  or  covalent  attachment 
to  an  intermediary  oligonucleotide  (inset*  right;  azide  on  probe  loop  covalently  attached  to  alkyne  on  TTT  -  L  intermediary  via  DIBO-based 
click  chemistry  and  gel -purified).  Separation  of  probes  was  measured  at  the  origami  attachment  point.  Records  produced  were  collected  from  the 
supernatant  without  disturbing  the  underlying  origami*  and  PCR-ampIified  with,  for  example*  ul“  and  a2*  primers,  (b)  The  maximum  “reach*' 
across  which  a  probe  pair  can  make  a  Full-record  is  determined  by  the  cumulative  lengths  of  their  components*  including  double-stranded  probe 
components  and  single -stranded  Half-record  components.  These  are  shown  as  rigid  and  flexible,  respectively,  corresponding  to  their  persistence 
lengths  of  -50  nm30  and  -2  nm,u  Numbers  indicate  length  in  nucleotides*  (c)  Otherwise  identical  probes  with  spacer  lengths  of  0  or  12  nt 
(attached  by  staple  extension)*  or  18  ni  (covalently  attached)*  were  held  in  pairs  separated  from  6  to  48  nm  by  6  nm  increments  (red  positions  in 
(a)),  recorded  (1  hour  at  room  temperature.  100  nM  primers)*  and  Log -phase  PCR  amplified  (20  cycles,  500  nM  primers)  to  gel-detectable  levels. 
Denaturing  PAGE  band  quantification  was  normalized  to  a  constant  reference  pair  for  each  well  Hie  entire  series  for  a  given  probe  pair  type  was 
then  tit  fay  least  squares  recursion  to  a  simple  sigmoidal  curve  cl/(l  +  Ex-p\c2{dis$  —  c3)]),  where  dial  represents  the  separation  distance  and 
c:l-r3  were  fit,  and  normalized  to  a  maximum  rate  of  1  See  probe  sequences  in  Si  Table  2. 


struction  of  overall  geometry  (Fig.  5).  Using  a  series  of  PCR 
assays  to  test  if  each  pairwise  combination  of  probes  produced 
.  records  (i.e.*  is  within  reach,  per  Fig.  4c)*  a  list  of  individual 
proximities  was  generated  from  which  to  reconstruct  the  under¬ 
lying  geometry.  The  longest  probe  tested  in  Fig.  4c*  with  an  18 
nt  spacer  (,?),  was  first  used  to  determine  the  relative  proximi¬ 
ties  of  three  targets.  Three  probes*  programmed  with  different 
primer  binding  sequences  (a)  and  therefore  generating  records 
with  unique  ends  (e.g.,  ends  a*  and  a  t  in  Fig.  2a),  were  again 
fixed  by  2D  origami  in  a  triangular  configuration  with  30  nm 
probe  separation  (Fig,  5a),  This  made  each  probe  within  the 
reach  of  the  other  two*  A  recording  wa*s  performed*  as  before* 
and  the  sample  was  split  into  3  volumes*  reflecting  the  number 
of  pa  i  r  w  i  se  pro  be  co  m  b  inat  i  ons.  Each  subs  amp  le  w  as  c  o  m  bi  ne  d 
with  a  different  pair  of  PCR  primers*  and  records,  if  any,  were 
amplified*  Gel  electrophoresis  and  band  quantification  indicated 
if  records  were  produced,  and  a  “graph”  depicting  relative  po¬ 
sitioning  was  computationally  reconstructed.  As  expected,  the 
triangular  arrangement  yielded  all  three  record  types*  suggest¬ 
ing  that  very  geometry.  In  contrast*  when  the  same  probes  were 
placed  in  a  straight  line  (Fig.  5b)*  only  two  record  types  were 
produced*  as  the  distance  between  P  l  and  P2  was  beyond  probe 
reach. 

More  complex*  7 -point  geometries  were  then  tested,  each 
using  shorter.  12  m-spacer  (s)  probes  arranged  on  a  smaller* 
24  nm  hexagonal  pattern  (Fig.  5c-e).  This  made  non-adjacent 
probes  at  least  42  nm  apart*  beyond  the  reach  of  these  probe 
pairs  (<  39  nm).  Each  7-probe  geometry  required  testing  of 
21  (6+5+4f3+2+l)  possible  probe  pairings,  yielding  one  of 
221  >  2  x  I0fi  possible  binary  combinations*  any  potentially 
-  valid.  As  in  Fig.  5a*  records  were  produced  from  these  origami* 
split*  and  amplified  by  PCR  in  the  21  possible  primer  pairings. 


The  hexagonal  geometry  of  Fig.  5c  yielded  12  of  21  positive 
proximity  pairs*  as  seen  by  gel  electrophoresis*  in  addition  to  the 
21  reference  pair*s  that  served  as  positive  PCR  controls.  Simple 
graphing  of  these  12  pairs  (Wolfram  Mathematics,  “GraphPlof 
function*  see  SI  Note  4)  resulted  in  the  correct  geometry. 

The  exceptional  fidelity  of  reconstruction  in  this  case  is  due 
to  a  combination  of  experimental  and  reconstruction  details. 
The  graphing  algorithm  used  here  is  based  on  a  physical  model 
( “spring-elcctric a  1”)  that  separates  the  final  probe  positions  rela¬ 
tively  uniformly,  and  is  therefore  particularly  congruent  w  ith  the 
regular  hexagonal  organization  of  the  origami  probes.  A  sense 
of  the  precision  of  this  reconstruction  is  shown  in  Fig.  5c,  inset. 
Given  the  positions  shown  for  any  6  probes*  the  7th  probe  may 
be  anywhere  within  the  shaded  region  and  still  satisfy  the  gel 
data  shown.  In  addition,  a  series  of  graphs  with  random  probe 
positions  were  generated  and  tested  for  compatibility  with  these 
same  data.  One  compatible  example  is  shown  in  ihe  Fig.  5c  in¬ 
set,  lower  graph,  again  with  shaded  regions  showing  the  preci¬ 
sion  of  a  given  probe  position*  See  SI  Fig.  10  for  more  examples. 

Two  further  examples  are  demonstrated.  Figure  5d*  where 
the  center  probe  was  moved  to  the  outside*  yielded  the  8  correct 
proximity  pairings  and  geometry.  Figure  5e.  where  the  lower 
probe  was  moved  to  the  outside*  yielded  the  1 1  correct  proxim¬ 
ities  and  geometry  . 

Note  that  the  relative  quantity  of  Full -records  created  should 
not  be  taken  as  a  measure  of  relative  distance  between  probes* 
but  only  that  the  appearance  of  records  should  be  taken  to  indi¬ 
cate  probes  within  reach  (proximity);  the  non-linear  record  gen¬ 
eration  rate  as  a  function  of  distance  (Fig*  4c),  variable  probe 
attachment  to  target*  and  possible  sequence -dependence  make 
distance  approximation  unreliable  from  this  simple  measure.  In¬ 
stead*  further  refinement  of  target-target  distances  may  be  per- 
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Figure  St  Complex  geometry  interrogation,  (a)  The  same  rectangular  origami  base  used  in  Fig.  4h  was  adapted  to  hold  three  probes,  of  18 
at  spacer  design,  at  30  nm  spacing  (see  SI  Fig.  6).  In  the  same  conditions  as  in  Fig.  4.  probes  were  lirst  placed  in  a  triangular  arrangement  and 
recording  was  carried  out.  The  sample  was  split,  records  PCR-amp!ificd  with  separate  pairs  of  probe  primers  (and  reference  primers),  and  products 
separated  by  gel  electrophoresis  and  quantified  (green  plot  profiles,  marked  (+)  when  judged  to  qualify  for  the  proximity  list).  As  expected,  all 
three  probe  pairs  yielded  records,  suggesting  a  triangular  shape  in  reconstruction,  (b)  When  those  same  probes  were  attached  in  a  line,  with  PI 
and  P2  out  of  reach  from  one  another,  only  two  record  types  were  amplified.  Tins  suggests  a  relatively  linear  arrangement,  (e)  More  complex, 
7 -probe  geometries  were  also  interrogated,  this  time  with  24  nm  hexagonal  grid  spacing  and  12  nt  spacer  probes.  The  full  set  of  21  primer  pair 
amplifications  yielded  the  gel  shown  in  the  center,  with  12  pair  in  proximity.  Computational  reconstruction  yielded  the  hexagonal  pattern  on 
the  right.  Inset  shows  positional  precision  of  any  single  point,  with  respect  to  others  shown,  as  shaded  region  that  satisfies  this  gel.  Top  shows 
precision  for  computed  hex  pattern,  bottom  shows  precision  for  random  pattern  that  also  satisfies  gel  result,  fd)  When  center  probe  P7  was  moved 
outside  the  hexagon,  APR  yielded  the  8  expected  proximities  and  correct  reconstruction,  (e)  Similarly,  when  Pi  was  moved,  the  11  expected 
proximities  and  correct  reconstruction  resulted.  Throughout,  computational  reconstruction  images  were  reflected  and  rotated,  but  not  distorted, 
for  best  comparison  to  the  programmed  structure  diagrams.  See  probe  sequences  in  SI  Table  2  and  full  gels  in  SI  Fig.  9. 


formed  by  sequential  experiments  with  probes  of  different  reach. 
For  example,  a  pair  of  !2  nt* spacer  probes  may  produce  records, 
indicating  targets  closer  than  36  nm,  while  a  pair  of  0  m-spaeer 
probes  at  those  same  positions  do  not,  indicating  those  same 
targets  are  also  farther  than  1 8  nm.  Alternatively,  a  single  exper¬ 
iment  w  ith  a  mixture  of  probes  with  unique  primers  and  variable 
reach  could  be  used.  See  SI  Fig.  1 1  for  examples  of  how  recon¬ 
struction  precision  could  be  further  refined  with  this  strategy. 


Application:  Re-sampling  of  a  dynamic  system 

In  addition  to  the  successful  interrogation  of  complex  geome¬ 
tries,  wc  demonstrate  the  advantages  of  non-destructive  record¬ 
ing  by  repeatedly  sampling  the  same  probes  in  a  changing  sys¬ 
tem.  The  same  origami- based,  triangular  arrangement  of  probes 
of  Fig.  5a-b  was  constructed,  with  a  mechanism  for  inactivat¬ 
ing  probe  P3  (Fig.  6a).  At  each  sampling  point,  the  supernatant 
was  split,  PCR-amp lifted,  and  observed  by  gel  electrophore¬ 
sis,  First,  the  triangular  arrangement  was  recorded  (with  primer 
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Figure  6:  State  change,  (a)  The  same  origami  probe  triangle  was  used 
as  in  Fig.  5e(i),  but  probe  P3  could  be  de-activated.  (see  SI  Fig.  6  for 
precise  probe  positions.)  (b)  PCR  umplifi cation  and  denaturing  PAGE 
of  recording  from  active  probe  (i),  after  intermediate  wash  (ii),  from 
de-iiclivated  probe  (iti),  after  second  intermediate  wash  (iv),  and  from 
re -activated  probe  (v).  The  intermediate  wash  results  demonstrate  that 
there  were  no  leftover  records  A  single  set  of  origami  was  used  for 
all  steps  and  re -sampled.  Origami  and  PAGE  otherwise  treated  as  in 
Fig.  5,  See  probe  sequences  in  SI  Table  2  and  full  gels  in  S]  Fig.  12. 

u*  on  P3)  and  analyzed,  indicating  the  co-localization  of  three 
probes  as  expected  (Fig.  6b,  i),  After  a  wash  with  buffer,  the 
supernatant  showed  no  residual  records  (Fig.  6b,  it).  Then,  by 
applying  an  inextensible  (inverted  dT  at  3'  end  to  prevent  exten¬ 
sion)  P3  primer  blocker  z*  —  u*  —  y*  to  the  system,  any  leftover 
P3  primers  and  Half-records  were  displaced  and  probe  P3  was 
made  unable  to  bind  primer,  rendering  it  deactivated.  Record¬ 
ing  then  indicated  co-localization  of  only  PI  and  P2  (Fig,  6b,  iii). 
To  sample  a  third  state,  the  same  origami  were  again  washed 
and  shown  record-free  (Fig.  6b,  iv)*  Probes  P3  were  then  re¬ 
activated  by  addition  of  strand y  —  u  —  c,  which  stripped  off  the 
primer  blockers.  Recording  and  analysis  once  again  indicated 
the  full  triangular  arrangement  (Fig.  6b,  v),  completing  three 
correct  samplings  of  the  same  targets. 

Discussion 

Given  that  structure  dictates  function  in  synthetic  and  natu¬ 
ral  nanostructures,  we  sought  to  record  nanoscale  proximity  and 
organization  in  as  precise  and  complete  a  manner  as  possible. 
To  this  end,  we  have  developed  a  biochemical  technique  termed 
Auto-cycling  Proximity  Recording  (APR).  The  two  key  mech¬ 
anisms  requiring  development  were  a  non-destructive,  continu¬ 
ous  (i.e„  “auto-cycling”)  method  for  copying  DNA  strands  from 
probes,  and  a  method  for  pairing  these  strands  only  in  the  pres¬ 
ence  of  probe  proximity  (“proximity  recording”).  The  first  was 
solved  by  the  development  of  the  Copy-and-Rdease  Hairpin 
(CRH),  which  allows  a  probe- specific  primer  to  be  extended  by 
a  short  tempiaLc,  and  Ihcn  isolhcrmally  released  for  further  re¬ 
action.  The  second,  proximity  requirement  was  enforced  by  en¬ 
coding  a  short  palindromic  site  at  the  3'  end  of  the  extension. 
The  extended  primers  represent  “Ha If- records”  that  must  pair 
by  their  palindromes  with  other  nearby  Half-records  before  they 
can  extend  over  each  other,  through  their  primer  domains*  and 
be  released  into  solution  as  “Full -records”  of  probe  pair  proxim¬ 
ity.  Subsequent  identification  of  Full -record  sequences  reveals 
proximity  or  putative  interactions. 


The  APR  method  shares  a  superficial  resemblance  to  Prox¬ 
imity  Ligation  (PEA)  and  Extension  (PEA)  techniques  in  that, 
fundamentally,  it  records  pairwise  interactions  in  DNA.  A  key 
difference  is  that,  because  the  probes  are  not  consumed  in  de¬ 
tecting  each  proximity,  many  pairwise  recordings  can  be  made 
from  the  same  probe.  This  critical  factor  allows  for  not  only  bet¬ 
ter  signal- tonoise  ratios,  but  also  re -sampling  of  dynamic  sys¬ 
tems,  and  ultimately  the  interrogation  of  true  single- molecule 
networks. 

We  have  taken  advantage  of  Lhe  performance  of  APR  to 
demonstrate  the  detection  and  reconstruction  of  complex  ge¬ 
ometries  and  the  re -sampling  of  dynamic  interactions  among  the 
same  targets,  all  with  simple  PCR  and  gel-based  readouts.  Ge¬ 
ometry  reconstructions  put  7  probes  in  the  correct  relative  posi¬ 
tion,  a  I -in-2  x  10c>  event  by  chance  alone,  and  reconstruction 
precision  is  tunable  by  using  probes  of  different  length.  Re¬ 
sampling  allowed  the  same  probes  to  be  recorded  3  times  in 
different  molecular- sc  ale  states,  a  task  impossible  with  most  as¬ 
says. 

Several  simple  modifications  to  probe  design  arc  apparent. 
Probes  can  be  attached  to  targeting  moieties  in  a  variety  of 
ways,  and  spacer  length  within  Lhe  probe  allows  for  reach  to 
be  tuned.  Within  the  probe  stem  sequence,  the  choice  of  a  palin¬ 
drome  allows  any  probe  to  potentially  produce  Full-records  with 
any  other  probe.  If  target  interactions  are  simple  heterodimers, 
however,  stronger  and  more  specific  Half- Record  binding  can 
be  designed  by  using  longer,  probe-specific,  non-palindromic 
sequences.  Cycling  rates  can  perhaps  be  increased  with  these 
longer,  non -palindromic  3'  sequences,  by  keeping  probe  spacers 
short,  or  by  using  alternative  means  of  biasing  the  Half-record 
strand  to  remain  single-stranded  and  available  for  pairing.  Fi¬ 
nally.  a  variety  of  stopper  modifications  can  be  used  for  optimal 
performance  under  different  conditions. 

An  assumption  in  describing  the  geometry  of  a  complex  de¬ 
tected  in  bulk  is  that  all  structures  are  the  same  or  similar.  In 
general,  detection  in  bulk  cannot  differentiate  between  a  sin¬ 
gle  structure  and  several  overlapping  pieces  in  the  system.  Be¬ 
yond  bulk  geometry  and  re- sampling,  however,  the  APR  method 
suggests  the  exciting  prospect  of  single- molecule  recording, 
wherein  globally  unique  barcodes*  corresponding  to  “names”  of 
each  individual  target  (rather  than  target  species),  arc  encoded  in 
the  Full-records,  Massively  parallel  Next  Generation  Sequenc¬ 
ing  could  then  be  used  as  a  readout  to  describe  every  interaction 
on  a  per- molecule  basis,  allowing  for  protein  counting,  interac¬ 
tion  stoichiometry  and  turnover,  real  network  connectivity  (as 
opposed  to  averaged  or  inferred  connectivity),  and  other  mea¬ 
sures  not  possible  with  current  techniques.  We  are  actively  pur¬ 
suing  these  single-molecule  capabilities. 
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Materials  and  methods 

sk-  Probes  Probes  were  ordered  from  Integrated  DNA  Technologies  with  possible 
modifications  of  internal  biotin  (ZiBiodT/),  internal  azide  (/i  AzideN/).  or  internal 
‘i- ‘"Spacer  9”  (/iSp9/).  See  SI  Fig,  2  and  SI  Tables.  I  and  2  for  probe  sequences. 
Strands  were  ordered  at  100  nmol  or  250  nmol  scale,  with  HPLC  purification. 

Some  probes  were  ordered  with  an  internal  biotin  modification  in  the  center 
-  of  the  5  nt  loop  domain,  in  which  case  probes  were  held  in  proximity  with  streps 
tavidin,  as  indicated  in  Fig.  3  and  SI  Fig.  2.  Probes  were  attached  to  streptavidin 
(New  England  Biolabs  (NEB))  in  a  20  /i I  reaction  containing  25  nM  streptavidin 
(Newr  England  Biolabs  (NEB))  and  140  nM  probe  (for  a  total  of  more  than  4: 1 
probe: streptavidin  to  ensure  streptavidin  saturation),  held  at  37 15 C  for  I  hour. 

Other  probes  were  covalently  linked  to  alky ne-modi tied  strands  by  click 
chemistry.  Azide  modified  raw  probes  were  ordered  directly  from  IDT  and 
dibenzocyclooctyne  (DBCG)  group  modified  linker  strands  were  purchased 
era  from  Boston  Open  Labs.  Both  DNA  strands  were  HPLC  purified  by  the 
manufacturers.  Typically,  ISO  pmol  of  azide  modified  probes  firstly  went 
through  But- mediated  extension  and  USER-mediated  cleavage  (for  random  bar- 
57s  codes/spacers),  and  then  column  purified  by  using  QlAquick  PCR  Purification 
Kit(Qiagen),  A  volume  of  9  til  of  I  mM  DBCO-lioker  strands  and  I  /d  of  I0x 
PBS  (pH  7.4)  buffer  were  added  to  20  /ri  of  the  eluted  azide  modified  probes 
and  incubated  at  room  temperature  for  12  h,  The  linked  probes  wrere  purified  by 
8%  denaturing  polyacrylamide  gel  and  quantified  by  NanoDrop  UV-Yis  Spec- 
trop  hotometer  (Thermo  Sc  ien  ti  fic ) . 

NUFACK  simulations  Expected  reaction  products  at  thermodynamic  equilib¬ 
rium  were  predicted  using  NUPACK"*  web-based  software  (n upack.org),  ‘"anal- 
ysis”  option.  Specified  options  were  "DNA"’  nucleic  acid  type,  37DC  tempera- 
:  ture.  10  nM  reactants,  with  maximum  complex  size  of  two.  Advanced  options 
0,05  mM  Na,  0.002  mM  Mg,  and  “air'  dangle  treatments.  Pair  probability  results 
were  downloaded  in  text  form  and  re-plotted  using  custom  software  (Wolfram 
557  Mathematics),  See  SI  Fig.  1, 

APR  recording  Recording  in  solution  was  carried  out  as  follows:  Probe  strands 
were  bound  to  streptavidin.  as  above.  A  mixture  of  100  /iM  each  dNTP  (NEB), 
lx  ThemoPol  Buffer  (NEB),  and  2x  Bst  large  fragment  polymerase  (NEB), 
200-400  nM  Cy  5 -labeled  primer  (IDT),  and  40  nM  probe  streptavidin  was  pre¬ 
pared  at  4°C  at  a  volume  of  10  /ri  per  reaction.  The  temperature  was  raised  to 
v>3  37°  C  for  120  min  (or  other  reaction  time),  followed  by  80°  C  heat  inactivation 
for  20  min. 

For  recording  reactions  on  origami,  flow  chambers  were  created  by  sticking 
a  freshly  cleaved  mica  (Ted  Pella)  piece  to  a  channel  slide  system  (ibidi,  Sticky- 
Slide  V10.4,  Cat.  No.  80608),  yielding  a  30  fit  chamber  volume.  After  washing 
the  chamber  three  times  with  60  fit  of  TAE/Mg  buffer  by  adding  the  buffer  to 
m  one  reservoir  (inlet)  and  simultaneously  taking  out  the  same  volume  from  the 
other  side  (outlet),  a  30  /d  solution  containing  origami  at  50  pM  concentration 
was  introduced  to  the  chamber.  (In  the  first  washing  round,  ~30  /it  of  buffer 
occupies  the  chamber  and  only  -30  fit  of  extra  buffer  comes  out.)  After  30  min 
of  incubation,  unbound  origami  and  extra  staple  strands  w'ere  removed  by  wash¬ 
ing  three  times  with  60  fi\  of  fresh  TAE/Mg  buffer  and  three  times  with  60  fi\ 
of  lx  ThermoPol  buffer  (NEB,  Cat,  No.  B9004S,  IGx  ;  lx  contains  20  mM 
Ttis-HCL  10  mM  (NH4):S04l  10  mM  KCE,  2  mM  MgSO4,0.i%  Triton  X-10G). 
To  the  chamber,  a  40  fit  solution  containing  8  units//*!  of  Bst  polymerase  (NEB, 
Cat,  No.  M0275S),  100  fM  dNTP  (NEB.  Cat,  No,  NG447S),  and  the  relevant 
eos  primer  mixes  typically  at  100  nM  each,  in  1  x  ThermoPol  reaction  buffer,  was 
added  and  incubated  for  l  hr  at  room  temperature.  After  the  recording  reac¬ 
tion,  the  supernatant  solution  containing  produced  records  was  collected  and  the 
.  polymerase  was  heat  inactivated  by  incubating  the  solution  at  SG^C  for  20  min. 
For  samples  used  in  the  geometry  studies  (three  point  and  hexagonal  grid  pat¬ 
terns)  and  the  state  change  study,  before  quenching  the  reaction,  extra  recording 
primers  contained  in  the  product  solution  were  removed  by  mixing  the  solution 
with  Exonudease  1  (NEB,  Cat.  No,  MQ293S)  at  9: 1  volume  ratio  and  incubat¬ 
ing  20  min  at  37 °G  This  minimize  any  false-positive  PCR  amplification  by  the 
an?  recording  primers, 

3  State  change  recordings  of  Fig.  6  were  carried  out  following  the  standard 
origami  procedure  described  just  above,  except  that  the  product  solutions  were 
collected  after  first  adding  40  fil  of  buffer  into  the  inlet,  to  prevent  drying  of  the 
chamber  between  rounds.  After  the  first  and  second  state  recording  rounds,  the 
chamber  was  washed  six  times  with  1  x  ThermoPol  buffer  before  the  "‘wash" 
samples  were  collected,  followed  by  three  times  with  TAE/Mg,  For  deactivat¬ 
es  mg  or  reactivating  the  probe  P3,  a  40  /ri  solution  containing  the  de-activator 
strand  at  10  p,M  or  the  re-activator  strand  at  0,1  ftM  respectively  in  TAE/Mg 
was  added  and  incubated  for  30  min  at  room  temperature,  after  which  the  cham¬ 
ber  was  washed  four  times  with  TAE/Mg  and  four  times  with  1  x  ThermoPol 
22'-  buffer  before  each  recording  reaction. 


For  the  7  point  hexagonal-grid  geometry  recordings  of  Fig.  5.  the  experimen¬ 
tal  condition  is  similar  to  She  ones  for  stage  change  recording  except  that  the 
annealing  temperature  for  PCR  amplification  was  to  52°  C.  with  200  nM  of  each 
primer  sequence  used  in  the  PCR  mixture. 

Electrophoresis  Gels  (8x8  cm)  for  denaturing  PAGE  were  cast  in  house  at  8% 
acrylamide  (LT- Baker),  with  7  M  urea  (Sigma)  and  1  x  TAE  (40  mM  Tris,  20  «: 

mM  acetic  acid,  1  mM  EDTA)  buffer  (GrowCells.com),  in  plastic  cassettes  (Life 
Technologies),  and  run  for  30-35  min  at  200  V  and  65°C  in  I  x  TAE  buffer.  Gels  h? 
were  then  removed  from  cassettes,  stained  in  I  x  SybrGold  (Life  Technologies) 
for  15  min,  and  imaged  with  a  Typhoon  scanner  (General  Electric). 

DNA  Origami  preparation  The  design  and  folding  protocols  of  the  2D  rectan¬ 
gular  origami  were  adapted  from  the  original  rectangle^  and  the  twist-corrected  mi 
version,10-33  Single -stranded  M13mpl8  DNA  (scaffold  strand)  was  purchased 
from  Bayou  Biolabs  (Catalogue  #P  - 1 07)  and  staple  strands  were  obtained  from 
Integrated  DNA  Technologies.  Staple  strands  were  purchased  and  used  un-  ^4 
purified,  except  the  staples  with  probe  extension  at  the  5"  end  of  the  staples, 
which  were  pu rc based  PAG E-purificd,  Scaffold  and  staples  were  mixed  together 
at  target  concentrations  of  5  nM  and  40  nM,  respectively,  in  TAE  (40  mM  Tris 
acetate,  I  mM  EDTA)  buffer  with  12,5  mM  magnesium  acetate  (TAE/Mg).  For 
origami  with  click-coupled  probes,  the  probe  strands  were  added  at  l  2-fold  the 
concentration  of  the  staple  strands  containing  the  anchor.  For  origami  folding, 
the  mixtures  were  kept  at  90° C  for  5  min  and  annealed  from  90°  C  to  60°  C  over  gs  i 
the  course  of  30  min,  then  from  60°  C  to  45aC  over  the  course  of  90  min,  and 
finally  from  45° C  to  25° C  over  the  course  of  20  min.  Folded  origami  solutions 
were  used  without  further  purification  except  where  otherwise  stated.  See  SI 
Fig  5  for  geometry  and  layout  and  SI  Table  1  for  staple  sequences 

Record  PCR  amplification  PCR  amplification  utilized  1  X  VentR  (exo-)  poly¬ 
merase  in  l  x  ThermoPol  Buffer  (Newr  England  Biolabs  (NEB)),  for  20  cycles, 
in  20  fit  reactions.  Primers  were  initially  present  at  0.5  j*M,  and  dNTP  mix 
(NEB)  at  100  ;*M  each.  Temperature  cycling  included  a  95° C  melt  for  2  min. 

20  cycles  (varies)  of  95°C  for  20  s  denaturing.  56^0  for  45  s  annealing,  and 
72^C  for  1 5  s  extension,  and  a  final  72°C  extension  lor  3  min. 
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Abstract  (154  words): 

Collections  of  DNA  sequences  can  be  rationally  designed  to  self-assemble  into  predictable  three- 
dimensional  structures.  The  geometric  and  functional  diversity  of  DNA  nanostructures  created  to  date 
has  been  enhanced  by  improvements  in  DNA  synthesis  and  computational  design.  However,  existing 
methods  for  structure  characterization  typically  image  the  final  product  or  laboriously  determine  of  the 
presence  of  individual,  labeled  strands  using  gel  electrophoresis.  Here,  we  introduce  a  new  method  of 
structure  characterization  for  simultaneous  quantification  of  every  strand  species  incorporated  into  a 
structure.  Our  method  uses  barcode  extension  and  next-generation  DNA  sequencing  to  quantitatively 
measure  the  relative  incorporation  of  individual  strands  into  a  DNA  nanostructure.  By  quantifying  the 
relative  abundances  of  distinct  DNA  species  in  product  and  monomer  bands,  we  can  study  the  influence 
of  geometry  and  sequence  on  assembly.  We  have  tested  our  method  using  2D  and  3D  DNA  brick  and 
DNA  origami  structures.  Our  method  is  general  and  should  be  extensible  to  a  wide  variety  of  synthetic 


DNA  structures. 


Introduction 


The  discovery  in  1982  that  DNA  can  seif-assemble  into  designed  structures  initiated  the  field  of 
structural  DNA  nanotechnology1.  Over  the  past  few  decades,  the  field  of  structural  DNA  nanotechnology 
has  produced  a  stunning  array  of  two-  and  three-dimensional  structures2'13.  These  structures  have  been 
used  for  a  variety  of  applications,  such  as  protein  structure  determination14,  enzyme  scaffold ing15-17, 
photonics  '  ,  and  drug  delivery  .  The  standard  workflow  is  typically:  structures  are  designed  on  a 
computer,  component  oligonucleotides  are  ordered  and  synthesized  commercially,  the  structures  are 
assembled  in  the  lab,  and  then  characterized  using  imaging  or  other  analytical  methods,  including  gel 
electrophoresis.  This  design-build-test  process  can  be  iterated  several  times  if  necessary  to  achieve  a 
design  with  high  performance. 

Most  aspects  of  the  design-build-test  cycle  have  been  dramatically  improved  over  the  past  few  decades. 
Structure  design  is  much  easier  now  than  it  was  in  1982,  as  evidenced  by  new  design  paradigms  (DNA 
origami3,  DNA  bricks8,  gridiron6,  3d  polyhedral  meshes10)  and  software  packages  for  automating 
structure  design  and  analysis  (NUPACK24,  caDNAno25,  Can  Do26).  The  synthesis  of  the  oligonucleotides 
that  form  the  staples  or  tiles  for  a  structure  is  growing  exponentially  cheaper.  Structure  assembly  Is  also 
faster  and  easier  than  ever  before,  with  the  recent  demonstration  of  isothermal  assembly  protocols  for 
DNA  origami  and  DNA  brick  structures27,28.  As  a  result  of  these  combined  advances,  one  can  now  design 
and  assemble  multiple  structure  designs  in  a  single  round  of  testing. 

Despite  the  many  advances  in  structure  design,  synthesis,  and  assembly,  structure  imaging  remains 
time-consuming  and  requires  dedicated,  expensive  machines.  In  recent  years,  new  technologies  such  as 
fast-scan  atomic  force  microscopy  (fast  AFM)  and  cryo-electron  microscopy  (cryo-EM)29,30  have 


increased  the  speed  and  resolution  with  which  DNA  nanostructures  can  be  imaged,  but  still  require 
substantial  equipment  investments  and  expertise  to  use  to  their  fullest  extent*  Super-resolution  optical 
microscopy  techniques  such  as  DNA-PAINT31  have  proven  very  helpful  for  imaging  structures  with 
multiple  orthogonal  labels32  in  3  dimensions33,  but  they  require  labeling  structures  with  organic  dyes  or 
single-stranded  extensions.  In  spite  of  these  advances,  it  remains  difficult  to  characterize  the  component 
composition  of  multidimensional  DNA  nanostructures  in  a  high-throughput,  label-free  fashion. 

In  addition  to  imaging  methods,  several  methods  based  on  gel  electrophoresis  can  be  used  to  analyze 
DNA  structure  assembly*  These  methods  compare  the  amount  of  material  present  in  monomer,  product, 
and  aggregate  bands,  and  measure  structure-wide  average  quality  or  the  site-specific  incorporation  of 
labeled  oligonucleotides.  The  simplest  such  label  is  a  fluorescent  intercalating  dye  (e.g.,  Sybr  Safe),  but 
de-Bruijn  probes  can  provide  more  quantitative  estimates  of  the  average  structure  quality34.  In  some 
cases,  one  cares  more  about  the  local  assembly  of  particular  structural  features,  rather  than  overall 
structure  quality.  In  these  cases,  fluorescently-labeled  oligonucleotides  are  typically  employed  to 
measure  site-specific  incorporation35,  or  FRET  is  used  to  measure  the  co-localization  of  two  structure 
components17  36.  These  methods  are  generally  simpler  to  employ  than  imaging,  but  they  do  not  provide 
detailed  information  about  the  component  composition  of  a  structure  with  single  component  strand 
resolution.  Thus,  measuring  the  incorporation  efficiency  of  all  of  the  components  of  a  DNA  structure 
remains  challenging. 

A  candidate  method  that  could  provide  detailed  quantitative  information  about  the  component 
composition  of  DNA  nanostructures  is  next-generation  DNA  sequencing  (NGS),  Since  each  component 
strand  in  a  uniquely  addressable  DNA  nanostructure  has  a  unique  DNA  sequence,  it  should  be  possible 


to  obtain  information  about  the  composition  of  an  entire  structure  with  single  tile  or  staple  resolution. 


NGS  thus  has  higher  multiplexing  capabilities  than  site  labeling  methods,  which  typically  are  limited  to 


labeling  a  few  component  oligonucleotides  at  a  time.  Unlike  imaging  methods,  NGS  allows  for  many 
samples  to  be  processed  in  parallel  using  sequence  barcodes,  thereby  increasing  the  throughput  of  the 
method  substantially.  Also,  since  the  sequencing  data  are  collected  as  an  unbiased  class  average  of 
many  individual  structures,  they  provide  a  rich  picture  of  the  statistics  of  self-assembly.  Furthermore, 
the  cost  of  NGS  has  dropped  exponentially  over  the  last  few  years,  making  this  an  increasingly  attractive 
and  affordable  analysis  technique.  NGS  has  been  used  by  biologists  to  measure  RNA  expression  levels37, 
ribosome  activity38,  transcription  elongation39,  and  protein-DNA  interactions40,  thus  it  should  be  possible 
to  apply  the  method  to  study  the  self-assembly  of  DNA  nanostructures  in  a  quantitative  fashion. 

Here,  we  introduce  a  method  for  studying  the  assembly  of  DNA  nanostructures  that  uses  next- 
generation  sequencing  to  quantify  the  relative  incorporation  of  staples  or  tiles.  The  method  works  by 
assembling  structures,  segregating  and  isolating  products  and  unincorporated  strands  using  gel 
electrophoresis  and  ligating  barcoded  adapter  sequences  to  the  ends  of  denatured  staples  or  tiles.  Once 
the  ligations  are  complete,  the  resulting  libraries  can  be  purified,  amplified,  and  analyzed  using  NGS.  We 
call  our  method  barcode  extension  for  analysis  and  reconstruction  of  structures,  or  BEARS.  Using  BEARS, 
we  demonstrate  the  simultaneous  quantification  of  each  of  the  components  of  2D  and  3D  DNA  brick 
and  DNA  origami  nanostructures.  Thus,  BEARS  can  be  used  to  study  the  assembly  of  a  wide  range  of  self- 
assembled  DNA  nanostructures. 

Results 


Developing  a  barcode  ligation  strategy 


The  BEARS  protocol  is  divided  into  five  basic  parts:  structure  assembly,  product  and  monomer 
purification,  denaturation  and  barcode  ligation,  sample  pooling,  and  next-generation  sequencing  {Fig. 
la).  Structure  assembly,  product  purification,  and  next-generation  sequencing  involve  standard 
procedures,  but  the  barcode  ligation  step  presents  a  unique  challenge  (Supplementary  Fig.  1).  Ideally, 
one  would  like  to  attach  barcodes  in  an  efficient,  unbiased,  and  sequence-independent  manner.  To  do 
this,  we  developed  a  barcode  ligation  strategy  that  uses  single-stranded  DNA  ligation.  First,  a  5' 
phosphorylated  adapter  with  a  3'  dideoxycytosine  modification  is  ligated  to  the  3'  end  of  a  tile  or  staple 
using  T4  RNA  ligase  1.  The  ligation  product  is  then  purified  on  a  PAGE  gel  (Supplementary  Fig.  2), 
phosphorylated,  and  a  barcoded  adapter  is  added  to  the  5'  phosphate  using  T4  RNA  ligase  1.  This  doubly 
ligated  product  is  column-purified,  amplified  using  PCR.  The  resulting  samples  (isolated  from  the 
product  band,  the  monomer  band,  or  the  input  oligonucleotide  mix  for  each  structure  design)  are 
pooled  to  enhance  the  throughput  of  the  method,  and  then  sequenced  using  NGS.  Although  BEARS 
requires  structure  purification,  this  is  not  a  limitation  because  structures  are  denatured  immediately 
after  purification.  Thus,  we  do  not  expect  purification  to  alter  the  component  composition  of  the 
structures  analyzed  using  BEARS.  For  further  details  regarding  sequencing  library  preparation,  see  the 
Methods  section  and  Supplementary  Figures  1  and  2. 

Sequencing  data  analysis 

NGS  sequencing  data  files  are  scanned  to  look  for  reads  corresponding  to  strands  present  in  a  DNA 
origami  design  or  DNA  brick  canvas  (Supplementary  Figure  1).  This  yields  a  list  of  read  counts  for  each 
strand  (for  a  histogram  of  the  read  counts,  see  Supplementary  Figure  3).  Read  fractions  are  computed 
from  the  read  count  lists  by  dividing  the  read  count  for  each  strand  by  the  sum  of  all  of  the  read  counts 
in  a  sample.  Next,  we  determine  which  strands  are  present  in  the  structure  by  taking  the  ratio  of  the 


read  fraction  in  the  product  band  to  the  read  fraction  in  the  entire  canvas  or  origami  design.  This  yields 


either  a  unimoda!  or  bimodal  lognormal  distribution,  with  the  rightmost  peak  corresponding  to  well- 
incorporated  strands  and  the  left-most  peak  corresponding  to  strands  that  were  not  present  in  the 
structure  (Supplementary  Figure  4).  If  one  peak  is  present,  then  ail  strands  are  well-incorporated,  which 
is  typically  the  case  with  DNA  origami  structures.  When  necessary,  we  apply  a  threshold  to  separate 
these  two  groups  of  strands.  Further  analysis  is  performed  on  the  strands  that  passed  the  threshold,  as 
described  below. 

To  quantify  the  relative  incorporation  of  strands  into  a  DNA  structure,  one  needs  a  metric  that  accounts 
for  ligation  and  NGS  sequence  bias.  There  is  some  sequence  bias  inherent  to  NGS,  as  we  obtain  a 
distribution  of  read  counts  even  for  oligonucleotides  mixed  at  equal  stoichiometry  (Supplementary 
Figure  3).  This  is  consistent  with  observations  in  other  NGS  library  preparation  methods  that  certain 
library  members  tend  to  be  over-  or  under-represented  based  on  their  sequence41.  Fortunately,  the 
sequence  bias  of  NGS  tends  to  be  consistent,  due  to  sequence-specific  increases  in  ligation  or 
amplification  efficiency  based  on  our  observations  (data  not  shown)  and  those  in  the  literature41-42. 

Thus,  by  analyzing  the  data  appropriately,  one  should  be  able  to  account  for  this  bias. 

To  account  for  the  sequence  bias  of  NGS,  we  introduce  a  new  metric  for  measuring  the  relative 
incorporation  of  strands  into  a  DNA  structure  called  structure-wide  relative  incorporability,  or  SRI.  SRI  is 
calculated  by  dividing  the  read  fraction  of  each  strand  in  the  product  by  the  sum  of  its  read  fractions  in 
the  product  and  monomer  bands.  As  a  result,  the  SRI  varies  from  0  (no  incorporation)  to  1  (full 
incorporation,  no  reads  in  the  monomer),  with  higher  SRI  values  indicating  better  incorporation  of  a 
strand  relative  to  the  other  strands  in  a  structure.  SRI  makes  two  key  assumptions:  that  the  variation  in 
stoichiometry  in  the  input  assembly  reaction  is  lower  than  the  variation  in  incorporation  into  structures, 


and  that  sum  of  the  product  and  monomer  bands  is  representative  of  the  entire  assembly  reaction.  It  is 


important  to  emphasize  that  SRI  measurements  are  relative  and  not  absolute  because  they  involve  the 


ratios  of  read  fractions  between  pairs  of  samples  (i.e.  the  product  band  and  the  monomer  band),  which 
are  effectively  ratios  of  ratios.  Thus,  SRI  is  not  a  linear  metric  for  strand  incorporation,  and  should  not  be 
treated  as  such.  Furthermore,  it  should  be  emphasized  that  the  SRI  is  not  equivalent  to  the  bulk  yield  of 
a  structure,  as  it  is  a  nonlinear  metric.  For  further  details  on  SRI  calculation,  see  the  Methods  section. 

Validating  BEARS 

BEARS  is  a  reproducible  and  quantitative  method  for  studying  structure  assembly.  We  assembled  a  2D 
DNA  brick  structure  on  two  separate  occasions  and  prepared  sequencing  libraries  from  these  replicate 
assemblies  on  separate  days.  We  calculated  SRI  maps  for  each  of  these  replicate  assemblies  using  BEARS 
(Fig.  2a).  Also,  we  directly  compare  the  SRI  for  each  brick  strand  in  replicate  structure  assemblies  using  a 
scatterplot  (Fig.  2b).  The  SRI  values  for  each  brick  strand  are  highly  correlated  between  replicate 
assemblies  (correlation  coefficient  =  0.925).  This  suggests  that  there  is  relatively  little  variation 
introduced  to  the  incorporation  data  due  to  structure  assembly,  gei  purification,  library  preparation,  and 
lllumina  sequencing.  It  is  likely  that  these  variables  would  influence  the  monomer  and  product  bands 
equally,  so  by  taking  the  ratio  of  these  samples  we  can  control  for  these  sources  of  variation. 

In  addition  to  reproducibility,  we  also  tested  the  quantitative  performance  of  lllumina  sequencing  by 
mixing  pools  of  pre-extended  oligonucleotides  with  varying  stoichiometries  (Fig.  2c).  This  allowed  us  to 
directly  amplify  and  sequence  the  mixes  without  any  ligation,  thereby  testing  the  effects  of  amplification 
and  NGS  on  quantification.  We  counted  the  number  of  reads  for  each  pool  of  oligonucleotides  in  each 
mix,  and  calculated  the  read  fraction  for  each  pool  by  dividing  the  read  counts  for  each  pool  by  the  total 
number  of  reads  in  the  mix.  We  then  compared  the  observed  read  fraction  for  each  pool  with  the 


expected  read  fraction  based  on  the  stoichiometry  in  the  mix,  which  was  set  to  0%,  10%,  20%,  30%,  or 


40%  based  on  the  pipetted  volume  (Fig.  2d,  see  Methods  section  for  details).  We  found  that  the 
expected  and  observed  read  fractions  were  highly  correlated  ( R 2  =  0.9995),  with  a  relatively  low  amount 
of  variation  between  mixes  (error  bars  indicate  one  standard  deviation).  Based  on  the  sizes  of  the  error 
bars,  we  can  detect  1.5-fold  changes  in  stoichiometry  with  high  reliability. 

Ligation  bias  does  not  substantially  alter  quantification  by  sequencing.  To  test  the  effect  of  ligation  bias 
on  the  quantitative  performance  of  BEARS,  we  mixed  pools  of  non-extended  oligos  with  varying 
stoichiometry,  and  prepared  libraries  for  sequencing  using  BEARS  (see  Methods  section  for  details).  This 
allowed  us  to  test  the  combined  effects  of  ligation,  amplification,  and  NG5  on  quantification.  As  in  Fig. 
2d,  we  calculated  read  fractions  for  each  pool,  and  compared  the  observed  read  fraction  for  each  pool 
with  the  expected  read  fraction  based  on  the  stoichiometry  in  the  mix  (Fig.  2e).  Although  the  correlation 
is  less  strong  (R2  =  0.9921)  than  in  the  pre-extended  case  and  the  error  bars  are  larger,  we  can  still 
robustly  detect  2-fold  changes  in  oligo  stoichiometry  using  BEARS.  This  indicates  that  ligation  bias  does 
add  some  variation  to  the  read  fractions  measured  using  BEARS. 

Using  BEARS  to  analyze  2D  DNA  brick  structures 

We  can  reconstruct  structures  from  a  2D  molecular  canvas  with  high  accuracy  using  BEARS.  We 
assembled  5  arbitrarily  chosen  letters  from  a  2D  molecular  canvas8  (the  canvas  consists  of  "pixels"  that 
are  individual  DNA  bricks  and  can  be  modularly  combined  to  form  a  variety  of  shapes)  and  one  newly- 
designed  bear  shape  using  a  subset  of  the  DNA  bricks  from  the  same  molecular  canvas,  and  prepared 
sequencing  libraries  from  both  product  and  monomer  bands  using  BEARS  (Fig.  3).  Each  panel  shows  a 
design  schematic,  4  representative  AFM  images  of  fully  assembled  structures,  thresholded  images  based 
on  Gaussian  fitting  of  read  fraction  ratios,  and  renderings  of  the  SRI  data  (see  Methods  section  for 
details)  for  that  structure  generated  using  BEARS.  When  we  computed  histograms  of  the  number  of 


reads  per  brick  strand  for  each  structure  design,  we  observed  a  bimodal  lognormal  distribution,  with  the 


rightmost  peak  corresponding  to  brick  strands  found  in  the  structure  and  the  leftmost  peak 
corresponding  to  reads  from  other  brick  strands  in  the  molecular  canvas.  We  used  this  distribution  to 
set  thresholds  for  rendering  the  SRI  data  for  each  structure.  SR!  values  are  displayed  on  a  color  scale 
ranging  from  0  (blue)  to  1  (yellow),  which  are  the  maximum  and  minimum  possible  SRI  values, 
respectively. 

We  observed  good  correspondence  between  the  designs,  AFM  data,  and  the  thresholded  sequencing 
data  for  all  of  the  structures  (Fig.  3  a-f,  first  3  rows).  In  a  few  cases,  we  obtained  sequencing  reads  which 
did  not  correspond  to  brick  strands  found  in  a  particular  design.  Conversely,  we  failed  to  detect  fewer 
than  1%  of  brick  strands  in  any  given  structure  design.  Reads  corresponding  to  brick  strands  not  found  in 
a  shape  could  come  from  contamination  present  in  the  strand  mix,  or  introduced  during  sample 
processing.  Brick  strands  within  a  shape  that  were  not  detected  in  either  the  monomer  or  product  band 
by  sequencing  likely  have  poor  ligation  efficiency,  and  were  more  prevalent  in  samples  with  lower 
sequencing  depth.  For  the  bear  design,  there  is  a  single  missing  brick  strand  in  the  hindquarters  that  is 
observed  both  in  the  AFM  images  and  in  the  thresholded  sequencing  data  (Fig.  3f).  Together,  these  data 
indicate  that  BEARS  can  be  used  to  reconstruct  2D  DNA  brick  structures  with  single  component 
resolution,  which  can  sometimes  be  difficult  to  resolve  with  AFM, 

SRI  data  from  BEARS  are  recapitulated  at  the  single  structure  level  by  AFM  data.  After  analyzing  the  SRI 
data  (Fig.  3,  fourth  row),  we  noticed  that  certain  structures  had  some  areas  with  poor  incorporation 
(blue),  predominantly  found  near  the  edges  of  structures,  or  in  thinner  features  (for  example,  the  legs  of 
the  bear  in  Fig.  3f).  We  were  able  to  find  a  number  of  partially  assembled  structures  which  were  missing 


these  poorly  incorporated  areas  when  imaging  with  the  AFM  (Supplementary  Fig.  5).  However,  we  did 


not  observe  a  correlation  between  GC  content  or  strand  AG  and  SRI  (Supplementary  Fig.  6),  Since  our 
method  involves  gel-purifying  assembled  structures,  we  conclude  that  the  SRI  data  represent  a  class 
average  of  the  structures  present  in  the  product  band. 

Using  BEARS  to  analyze  3D  DNA  brick  structures 

BEARS  can  be  used  to  analyze  3D  structures,  including  strands  on  the  inside  of  the  structures.  Such 
analysis  would  be  difficult  to  obtain  using  standard  TEM  methods.  We  assembled  a  3D  DNA  brick  cuboid 
with  a  tunnel  along  the  long  axis  (Fig.  4a),  imaged  the  product  band  using  TEM,  and  sequenced  the 
monomer  and  product  bands  using  BEARS  (Fig.  4b,  c).  We  calculated  the  SRI  of  each  brick  strand  as  in 
Fig,  2e  and  projected  the  incorporation  data  along  each  of  the  major  axes  of  the  structure  (Fig.  4b,  c). 
The  tunnel  in  the  middle  of  the  structure  is  clearly  visible  in  the  rendering  from  the  appropriate 
perspective  (Fig,  4c),  as  well  as  a  stripe  of  decreased  intensity  from  left  to  right  along  the  long  axis  in  the 
other  two  projections  (Fig.  4b).  This  could  indicate  directional  bias  in  DNA  brick  incorporation. 
Furthermore,  the  spatial  patterns  of  incorporation  can  be  visualized  by  displaying  slices  through  the  long 
axis  of  the  structure  (Fig.  4d,  see  Supplementary  Fig,  7  for  a  larger  image).  Such  data  indicate  that  there 
is  some  spatial  clustering  present  in  the  SRI  data.  In  particular,  the  right  edge  of  the  structure  contains 
two  layers  with  poor  SRI  (Fig.  4d,  blue  slices).  As  with  the  2D  DNA  brick  structures,  we  did  not  observe  a 
correlation  between  the  strand  GC  content  or  free  energy  and  the  SRI  measurements  (Supplementary 
Fig.  8).  These  data  highlight  the  power  of  a  label-free  approach,  as  they  include  many  brick  strands  in 
the  interior  of  the  structure  which  may  interfere  with  structure  assembly  when  labeled. 

Extending  BEARS  to  DNA  origami  structures 

BEARS  can  be  used  to  calculate  the  SRI  of  staples  in  DNA  origami  structures.  We  assembled  a  2D  DNA 
origami  rectangle  (Fig.  5a,  b)  and  a  3D  DNA  origami  cuboid  (Fig.  5c-e),  imaged  the  product  bands  using 


AFM  or  TEM  respectively,  and  sequenced  the  monomer  and  product  bands  using  BEARS.  Most  staples 


had  high  SRI  into  the  2D  origami  rectangle,  with  a  few  exceptions  (blue).  For  a  larger  image  of  the  slices 
of  the  3D  origami  cuboid,  see  Supplementary  Fig.  9.  As  with  the  DNA  brick  structures,  we  did  not 
observe  a  correlation  between  the  strand  GC  content  or  free  energy  and  the  SRI  measurements 
(Supplementary  Fig.  10).  Also,  we  observe  that  there  is  less  spatial  correlation  between  SRI  values  in 
origami  structures  compared  with  DNA  brick  structures.  Thus,  we  conclude  that  BEARS  can  be  extended 
to  measure  the  SRI  of  staples  in  DNA  origami  structures.  Given  this,  we  expect  that  BEARS  will  be 
extensible  to  a  wide  variety  of  DNA  nanostructures  with  uniquely  addressable  components. 

Discussion 

Here,  we  have  demonstrated  a  new  method  for  quantifying  the  component  composition  of  DNA 
nanostructures  called  BEARS.  BEARS  is  high-throughput,  label-free,  and  generates  data  that  correlate 
with  AFM  images  of  individual  structures.  We  believe  that  BEARS  is  complementary  to  other  structure 
characterization  methods,  such  as  AFM/TEM  imaging,  and  gei-based  labeling.  In  particular,  one  can  use 
BEARS  to  augment  the  resolution  of  a  structure  image,  if  one  is  not  able  to  obtain  component-level 
resolution  by  imaging.  Alternatively,  one  can  use  BEARS  to  screen  a  new  set  of  structure  designs,  and 
then  image  the  designs  with  higher  SRI  values  in  areas  of  interest  using  AFM  or  TEM.  In  these  ways,  we 
envision  that  BEARS  will  help  remove  some  of  the  existing  bottlenecks  in  structure  characterization, 
allowing  one  to  design,  build,  and  test  more  structures  than  was  previously  possible. 

One  use  case  for  BEARS  is  to  determine  the  weak  points  of  a  structure  design  and  improve  them  with  a 
new  design.  This  is  particularly  important  for  applications  in  which  guest  molecules  are  attached  at 
specific  points  on  a  structure  -  these  should  be  chosen  to  have  the  highest  SRI,  or  redesigned  to 


optimize  the  yield.  At  present,  the  incorporation  data  represent  a  class  average  of  structures  purified 
from  a  product  band  on  a  gel.  This  provides  a  useful  overview  of  which  parts  of  a  structure  assemble 
well  and  which  do  not,  but  it  would  also  be  interesting  to  get  sequencing  data  at  the  single  structure 
level.  One  limitation  of  our  method  is  that  one  needs  a  clear  product  band  in  order  to  ensure  that  the 
reads  one  generates  come  from  properly  assembled  structures.  However,  this  can  likely  be  surpassed  by 
fractionating  an  entire  gel  lane,  or  by  using  other  methods  for  structure  purification  such  as  PEG 
precipitation  or  glycerol  gradient  centrifugation43,44.  These  sorts  of  improvements  to  BEARS  will  be 
driven  by  decreases  in  the  cost  of  NGS,  enabling  one  to  sequence  more  samples  in  parallel,  with  higher 
depth  per  sample,  thereby  yielding  better  statistics  about  the  self-assembly  of  populations  of  structures. 

In  addition  to  NGS,  DNA  synthesis  is  also  experiencing  a  rapid  decrease  in  price  over  time.  One 
technology  which  will  hopefully  increase  this  trend  is  chip-based  DNA  synthesis,  which  allows  for  the 
production  of  tens  of  thousands  of  oligonucleotides  at  once4S.  These  oligo  libraries  can  then  be  used  to 
assemble  large  libraries  of  regular-sized  structures,  or  larger  structures  themselves46.  Larger  structures 
can  be  difficult  to  characterize  by  imaging  when  the  assembly  yield  is  low,  and  it  is  easy  to  damage  these 
structures  during  sample  preparation  and  processing.  Such  structures  are  also  of  interest  because  they 
tend  to  assemble  with  lower  yield,  thus  it  is  quite  possible  that  their  assembly  pathway(s)  are  very 
suboptimal,  limiting  the  yield.  Using  BEARS,  one  could  perhaps  not  only  get  component-level  resolution 
renderings  of  these  large  structures,  but  also  improve  the  yields  of  the  structures  by  getting  around 
limitations  in  the  assembly  yield.  Overall,  a  combination  of  single  component  resolution,  high 
throughput,  and  built-in  class  averaging  make  BEARS  a  promising  method  for  characterizing  a  wide 
variety  of  DNA  nanostructures  in  the  coming  years. 


Methods 


Structure  designs 


The  2D  DNA  BRICK  structures  shown  in  Fig,  3  are  derived  from  the  R6  canvas  described  in  Wei  et.  al 
2012s.  The  letters  A,  B,  E,  R,  and  S  were  described  in  that  work,  whereas  the  bear  structure  was 
designed  using  a  different  subset  of  the  R6  canvas.  The  3D  DNA  brick  structure  shown  in  Fig.  4  has  a  13 
nt  domain  length  (Ong  et  al.  unpublished).  The  2D  origami  rectangle  shown  in  Fig.  5  is  based  on  a  twist- 
corrected  version117  of  the  original  2D  origami  rectangle5,  but  with  a  different  scaffold  sequence32.  The 
3D  origami  cuboid  shown  in  Fig.  5  was  designed  using  caDNAno.  For  staple  or  brick  strand  sequences 
and  structure  design  schematics,  see  Supplementary  Data. 

Structure  assembly 

The  letters  B,  E,  A,  R  and  the  bear  shape  were  assembled  in  assembly  buffer  (5  mM  Tris-HCI,  1  mM 
EDTA,  adjusted  to  pH  8.0)  supplemented  with  25  mM  MgCI2  using  the  following  annealing  protocol:  95 
°C  for  1  minute,  anneal  from  90  °C  to  60  °C  at  5  minutes  per  degree,  then  from  60  °C  to  25  °C  at  25 
minutes  per  degree,  followed  by  a  hold  at  25  °C.  For  AFM  imaging,  the  letter  5  was  assembled  in 
assembly  buffer  (5  mM  Tris-HCI,  1  mM  EDTA,  adjusted  to  pH  8.0)  supplemented  with  25  mM  MgClj 
using  a  modified  annealing  protocol:  95  °C  for  1  minute,  anneal  from  90  °C  to  60  °C  at  5  minutes  per 
degree,  then  from  60  °C  to  40  °C  at  45  minutes  per  degree,  followed  by  a  hold  at  25  °C. 

For  the  2D  origami  rectangle,  scaffold  (M13,  purchased  from  Bayou  Biolabs)  and  staples  were  mixed 
together  at  target  concentrations  of  10  nM  and  100  nM,  respectively,  in  TAE  (40  mM  Tris  acetate,  1  mM 
EDTA)  buffer  with  12.5  mM  magnesium  acetate  (TAE/Mg).  For  2D  origami  folding,  the  mixtures  were 
kept  at  90  °C  for  5  min  and  annealed  from  90  °C  to  60  °C  over  the  course  of  30  min,  from  60  °C  to  45  °C 


over  the  course  of  90  min,  and  from  45  °C  to  25  °C  over  the  course  of  20  min.  For  the  3D  origami  cuboid. 


scaffold  (p756013)  and  staple  strands  were  mixed  at  10  nM  and  100  nM  respectively  in  assembly  buffer 
(5  mM  Tris-HCI,  1  mM  EDTA,  adjusted  to  pH  8.0)  supplemented  with  10  mM  MgCl2  and  annealed  over  3 
days  using  the  following  protocol:  anneal  from  80  °C  to  60  °C  at  2  minutes  per  degree,  then  from  60  °C 
to  25  °C  at  2  hours  per  degree.  For  the  3D  DNA  brick  cuboid,  brick  strands  were  mixed  together  at  100 
nM  in  assembly  buffer  (5  mM  Tris-HCI,  1  mM  EDTA,  adjusted  to  pH  8.0)  supplemented  with  20  mM 
MgCI2  and  annealed  over  3  days  using  the  following  protocol:  anneal  from  80  °C  to  60  °C  at  2  minutes 
per  degree,  then  from  60  °C  to  25  °C  at  2  hours  per  degree. 

Gel  electrophoresis 

The  2D  DNA  brick  shapes,  2D  origami,  and  3D  origami  were  analyzed  by  electrophoresis  in  a  native  1.5% 
agarose  gel  supplemented  with  10  mM  MgCI2.  Electrophoresis  was  performed  at  90  V  for  2  hours  in  an 
ice-water  bath.  Gels  were  pre-stained  with  lx  Sybr  Safe  (Life  Technologies).  The  3D  DNA  brick  cuboid 

was  analyzed  by  electrophoresis  in  a  native  1%  agarose  gei  supplemented  with  10  mM  MgCI2. 

\ 

Electrophoresis  was  performed  at  80  V  for  2  hours  in  an  ice-water  bath.  Gels  were  pre-stained  with  lx 
Sybr  Safe  (Life  Technologies).  Afterwards,  gels  were  scanned  with  a  Typhoon  FLA  9000  (General  Electric) 
using  the  SYBR  Safe  channel  (excitation  at  473  nm,  emission  >  510  nm). 

Gel  bands  were  visualized  using  a  Safe  Imager  2.0  Blue-Light  Transilluminator  (Invitrogen)  and  excised 
from  the  gel  using  a  fresh  razor  blade.  The  excised  piece  was  then  placed  into  a  Freeze  'N  Squeeze 
column  (Bio-Rad)  and  crushed  using  a  plastic  pestle  (USA  Scientific).  For  the  2D  DNA  brick  shapes,  2D 
origami  rectangle,  and  the  3D  origami  cuboid,  structures  were  eluted  from  the  column  by  centrifugation 
at  400  x  g  for  3  minutes.  For  the  3D  DNA  brick  cuboid,  structures  were  eluted  from  the  column  by 
centrifugation  at  1,200  x  g  for  3  minutes. 


Atomic  force  microscopy 


Images  of  folded  structures  were  obtained  with  a  Veeco  Multimode  V  atomic  force  microscope,  C-type 
Broker  SNl-10  tips  were  used  under  tapping  mode  in  fluid.  Samples  (25  pi)  were  deposited  on  the  mica 
surface  for  1  minute.  The  mica  surface  was  then  rinsed  5  times  with  Q.5x  TE  (5  mM  Tris,  1  mM  EDTA, 
adjusted  to  pH  8.0)  supplemented  with  25  mM  MgCI2.  For  the  2D  DNA  brick  shapes,  samples  were 
supplemented  with  5  mM  NiCb  (final  concentration}  to  aid  in  attachment  to  the  mica  surface  before 
imaging.  The  2D  origami  rectangle  was  imaged  in  lx  TE  (10  mM  Tris,  1  mM  EDTA,  adjusted  to  pH  8.0) 
supplemented  with  25  mM  MgCI2- 

Transmission  Electron  Microscopy 

2.5  pi  of  each  sample  was  deposited  on  to  glow-discharged,  carbon-coated  EM  grids  for  2  minutes.  The 
liquid  was  wicked  off,  and  2.5  pi  of  stain  (2%  urany!  formate  +  25  mM  NaOH)  was  added.  The  3D  DNA 
brick  structure  was  stained  for  30  seconds,  and  the  3D  origami  cuboid  was  stained  for  45  seconds.  After 
staining,  excess  liquid  was  wicked  off.  Atl  samples  were  imaged  using  a  JEOL  JEM-1400  TEM  operating  at 
80  kV. 

Oligonucleotide  mixing  experiments 

For  the  experiments  described  in  Fig,  2d,  thirty  oligonucleotides  of  length  80  nt  were  divided  into  5 
pools  of  6  oligonucleotides  each.  These  pools,  1-5,  were  mixed  together  with  systematically  varying 
stoichiometry  as  follows:  Mix  A  1:1:1:1:1,  Mix  8  1;2:3:4:0,  Mix  C  0:1;2:3:4,  Mix  D  4:0:1:2:3,  Mix  E 
3:4:0:1:2,  Mix  F  2:3:4:0:1.  Thus,  in  each  mix,  each  pool  is  present  at  0%,  10%,  20%,  30%,  or  40%  of  the 


total  mix.  The  mixes  A-F  were  amplified  using  2  cycles  of  PCR  and  sequenced. 


For  the  experiments  described  in  Fig.  2e,  thirty  oligonucleotides  of  length  42  nt  were  divided  into  5 
pools  of  6  oligonucleotides  each.  These  pools,  1-5,  were  mixed  together  with  systematically  varying 
stoichiometry  as  follows:  Mix  A  1:1:1:1:1,  Mix  B  1:2:3:4:0,  Mix  C  0:1:2:3:4,  Mix  D  4:0:1:2:3,  Mix  E 
3:4:0:1:2,  Mix  F  2:3:4:0:1.  Thus,  in  each  mix,  each  pool  is  present  at  0%,  10%,  20%,  30%,  or  40%  of  the 
total  mix.  The  mixes  A-F  prepared  for  sequencing  using  the  full  BEARS  protocol  (see  below). 

Data  were  analyzed  by  calculating  the  average  number  of  reads  per  oligo  in  each  pool,  then  dividing  the 
read  fractions  in  Mixes  B-F  by  the  read  fractions  calculated  from  Mix  A,  which  contains  each  oligo  pool 
mixed  with  a  1:!:1:1;1  stoichiometry.  The  resulting  normalized  read  fractions  were  compared  with  the 
expected  read  fractions  based  on  the  mix  stoichiometries.  The  data  shown  in  Fig.  2d  and  2e  are  the 
mean  normalized  read  fraction  for  each  of  the  5  pools  at  each  expected  read  fraction,  based  on  the 
normalized  data  from  Mixes  B-F  (see  formula  below)*  For  example,  for  pools  present  at  10%,  we  expect 
the  mean  read  fraction  to  be  1.  6%.  Error  bars  were  calculated  by  taking  the  standard  deviation  from 
the  5  pools  with  a  given  expected  read  fraction, 

NRFa  =  RFb/RFa 

Sequencing  library  preparation 

Gel-purified  monomer  or  product  bands  were  concentrated  using  the  Oligo  Clean  and  Concentrator  kit 
(Zymo  Research),  eluting  with  6  pi  of  milliQ  H2Q.  Samples  were  denatured  by  heating  to  95  °C  for  5 
minutes.  The  3dC  adapter  sequence  (see  Supplementary  Table  1  for  details)  was  then  ligated  to  the  31 
ends  of  the  staples/brick  strands  using  T4  RNA  ligase  1  (New  England  Biolabs)*  Each  10  pi  ligation 
reaction  contained  10  units  of  enzyme,  25%  (w/v)  PEG-8000,  lx  T4  RNA  ligase  buffer,  5  pmol  of  the  3dC 
adapter,  2-5  pmol  of  brick  strands/staples,  and  ATP  at  a  final  concentration  of  1  mlVT  Ligation  reactions 
were  incubated  overnight  at  room  temperature,  then  heat-inactivated  at  65  °C  for  20  minutes. 


After  heat-inactivation,  samples  were  analyzed  using  denaturing  polyacrylamide  gel  electrophoresis 


(Supplementary  Fig.  2a).  Samples  were  mixed  1:1  with  2x  RNA  loading  dye  (New  England  Biolabs), 
denatured  for  10  minutes  at  70  °C,  and  loaded  onto  a  precast  10%  TBE-urea  gel.  Electrophoresis  was 
performed  at  65  °C  for  35-55  minutes  at  180  volts  using  0.5x  TBE  as  a  running  buffer.  Gels  were  post- 
stained  with  lx  Sybr  Gold  (Invitrogen)  for  30  minutes  in  an  orbital  shaker.  Ligation  product  bands  were 
visualized  using  a  Safe  Imager  2.0  Blue-Light  Transilluminator  (Invitrogen),  and  excised  from  the  gel 
using  razor  blades  or  1. 1x6.5  mm  gel  cutting  tips  (MidSci). 

Gel  slices  were  placed  into  dialysis  tubes  (Slide-A-Lyzer  MINI  Dialysis  Device,  2K  MWCO,  0.1  mL)  and  50 
pi  of  0.5x  TBE  was  added  to  submerge  the  gel  slices.  Electroelution  was  performed  at  90  V  for  30 
minutes  in  0.5x  TBE,  followed  by  reversing  the  leads  and  running  for  '“30  seconds  to  prevent  the  DNA 
from  being  stuck  to  the  surface  of  the  dialysis  tube.  Electroeluates  were  then  concentrated  using  the 
Oligo  Clean  and  Concentrate  kit  (Zymo  Research),  eluting  with  6  pi  of  miiliQ  H20. 

The  gel-purified  ligation  product  was  phosphorylated  using  T4  polynucleotide  kinase  (New  England 
Biolabs).  Phosphorylation  reactions  were  carried  out  in  a  10  pi  reaction  volume  containing  5-10  units  of 
enzyme,  lx  T4  PNK  buffer,  and  1  mM  final  concentration  of  ATP.  Reactions  were  incubated  for  30 
minutes  at  37  °C,  then  heat-inactivated  at  65  °C  for  20  minutes. 

The  5bc  adapter  sequence  (containing  indexing  barcodes,  see  Supplementary  Table  1  for  details)  were 
then  ligated  to  the  3'  ends  of  the  staples/brick  strands  using  T4  RNA  ligase  1  (New  England  Biolabs). 
Each  20  pi  ligation  reaction  contained  2  pi  of  enzyme  at  10  units/pl,  10  pi  of  50%  (w/v)  PEG-8000  (final 
concentration:  25%  (w/v)),  2  pi  of  lOx  T4  RNA  ligase  buffer,  5  pmol  of  the  3dC  (1  pi  at  5  pM),  3  pi  of 


phosphorylated  ligation  1  product,  and  2  pi  of  10  mM  ATP  (final  concentration:  1  mM).  Ligation 
reactions  were  incubated  overnight  at  room  temperature,  then  heat-inactivated  at  65  °C  for  20  minutes. 
We  then  purified  the  samples  using  the  Oligo  Clean  and  Concentrate  kit  (Zymo  Research),  eluting  with  6- 
10  pi  of  milliQ.  H20. 

Library  amplification  and  quantification 

Individual  samples  were  amplified  before  pooling  using  Q5  polymerase  (from  2x  master  mix  purchased 
from  NEB)  and  previously  validated  lllumina  qPCR  primers  at  a  final  concentration  of  300  nM.  Between  2 
and  10  pi  of  template  was  used  in  a  50  pi  PCR  reaction.  Sequencing  libraries  were  quantified  using 
quantitative  PCR.  Q5  polymerase  (NEB)  was  used  according  to  the  manufacturer's  instructions. 

Previously  validated  lllumina  qPCR  primers42  were  synthesized  by  IDT  and  used  at  a  final  concentration 
of  300  nM.  Sytol3  (Molecular  Probes  /  Life  Technologies)  was  used  as  a  fluorescent  indicator  dye 
according  to  the  manufacturer's  instructions.  DNA  standards  1-6  from  the  Kapa  NGS  library 
quantification  kit  were  used  to  make  a  standard  curve  for  absolute  concentration  determination  (sample 
qPCR  data  are  shown  in  Supplementary  Fig.  2b). 

Next-generation  sequencing 

Libraries  were  sequenced  using  an  lllumina  MiSeq  machine  according  to  the  manufacturer's  instructions 
using  the  MiSeq  V2  paired  end  50  kit  (lllumina  Inc.,  San  Diego,  CA).  In  some  cases,  we  used  a  modified 
library  denaturation  and  loading  protocol  optimized  for  lower-concentration  libraries42. 

Data  analysis 

Sequence  processing  was  done  using  custom  MATLAB  software  (Supplementary  xxx).  Fastq  files  from 


the  MiSeq  were  parsed  and  partitioned  based  on  the  index  reads.  Sequence  matching  was  done  using 


regular  expressions  to  query  for  an  exact  match  to  the  first  20  bases  of  a  brick  strand  or  staple,  allowing 


between  4  and  7  random  nucleotides  at  the  5'  end  of  the  read.  These  random  nucleotides  are  part  of 
the  5'  sequencing  adapter,  and  add  sequence  diversity  to  first  few  cycles  of  the  sequencing  (necessary 
for  the  machine  to  focus  properly).  Data  were  filtered  by  disregarding  strands  with  fewer  than  25  reads 
present  in  the  product  band,  as  there  is  not  sufficient  information  to  accurately  quantify  the 
incorporation  for  these  strands.  Read  counts  were  normalized  to  the  overall  number  of  reads  in  a 
sample,  resulting  in  "read  fraction"  data  for  both  the  product  and  monomer  bands. 

After  calculating  the  read  fraction  for  each  strand  in  a  structure,  we  applied  a  threshold  to  determine 
which  strands  were  actually  present  in  a  structure  and  which  were  not.  Specifically,  we  took  the  ratio  of 
the  read  fraction  for  each  strand  in  the  product  band  and  the  read  fraction  for  each  strand  in  the  canvas 
mix.  This  resulted  in  a  bimodal  lognormal  distribution  for  most  shapes,  since  they  contain  a  subset  of  all 
of  the  strands  in  the  canvas.  By  taking  the  ratio,  we  control  for  how  well  each  individual  strand  is  ligated, 
amplified,  and  sequenced.  Thresholds  were  determined  by  fitting  the  sum  of  two  Gaussian  distributions 
to  histograms  (with  20  evenly  spaced  bins)  of  the  log2  of  the  read  fraction  ratio  (product/canvas) 
mentioned  above  using  MATLAB.  Specifically,  threshold  was  taken  as  the  minimum  value  of  the  sum  of 
the  Gaussian  fits,  rounded  up  to  by  the  width  of  one  bin  from  the  histogram.  Strands  which  passed  the 
threshold  were  analyzed  further,  as  described  below. 

SRI  values  were  calculated  based  on  the  ratio  of  the  read  fraction  in  the  product  band,  divided  by  the 


sum  of  the  read  fractions  in  the  monomer  and  product  bands  (see  formula). 


Where  Pi  and  Mj  are  the  read  counts  for  oligonucleotide  i  in  the  product  and  monomer  bands, 
respectively,  and  P,ot  and  Mtot  are  the  total  read  counts  in  the  product  and  monomer  bands.  We  divide  Pi 
and  Mi  by  Ptot  and  Mtot  respectively  because  the  total  number  of  reads  varies  between  samples;  thus  the 
read  count  is  inherently  a  relative  measure  rather  than  an  absolute  measure.  This  metric  has  two  key 
assumptions.  The  first  assumption  is  that  the  variation  in  the  starting  concentrations  of  the 
oligonucleotides  used  for  self-assembly  is  lower  than  the  variation  expected  to  be  found  in  the  product 
band.  Otherwise,  the  method  will  tend  to  measure  variation  in  the  starting  stoichiometry,  rather  than  in 
the  relative  incorpora bility  of  DNA  oligonucleotides  into  the  structures  themselves.  The  second 
assumption  is  that  the  even  distribution  of  oligonucleotide  stoichiometry  is  not  influenced  by 
aggregation.  If  aggregation  is  strand-specific,  then  certain  oligonucleotide  might  get  sequestered 
preferentially  in  aggregates,  thereby  biasing  the  incorporation  measurements  for  those 
oligonucleotides.  So  long  as  these  two  assumptions  hold,  the  SRI  is  a  good  proxy  for  the  relative 
incorporability  of  an  oligonucleotide  in  a  DNA  nanostructure.  We  expect  the  variation  in  starting 
stoichiometry  to  be  small  (~10%)  based  on  spectrophotometric  measurements  of  oligo  concentrations 
{data  not  shown).  Also,  we  expect  that  aggregation  will  not  be  strand-specific,  as  it  is  likely  mediated  by 
nonspecific  interactions  between  oligonucleotides  or  structures. 

Structure  rendering 

Coordinates  of  each  strand  were  parsed  from  the  caDNAno  design  files  using  custom  MATLAB  software, 
and  are  colored  based  on  the  SRI  of  each  brick  strand  or  staple.  Brick  strands  with  fewer  than  25  reads  in 
the  product  band  are  colored  gray.  Half-brick  strands  and  edge  protector  brick  strands  are  also  colored 
gray.  Data  are  rendered  using  an  inter-helix  distance  of  2  nm  and  10.67  nm  per  helical  turn,  based  on 


the  square  lattice  model. 


Author  contributions 


CM  and  MB  conceived  of  the  project  and  designed  the  sequencing  protocol.  NH  and  LO  imaged  the 
structures.  CM,  NH,  JG,  and  performed  sequencing  experiments.  PY  supervised  the  project.  CM  wrote 
the  manuscript,  and  MB,  NH,  and  PY  edited  the  manuscript. 

Acknowledgements 

We  acknowledge  Omar  Yaghi  and  Thomas  Schaus  for  useful  discussions.  We  acknowledge  Sungwook 
Woo  for  help  with  the  2D  origami  rectangle  structures,  and  Sarah  Boswell  for  help  with  MiSeq 
sequencing.  This  work  was  funded  by  the  NSF  grant  #1162459.  CM  is  funded  by  the  Fannie  and  John 
Hertz  Foundation,  and  NH  by  the  German  National  Academic  Foundation  and  German  Academic 


Exchange  Service. 


References 


1.  Seeman,  N.  C.  Nucleic  acid  junctions  and  lattices.  J.  Theor.  Biol  99,  237-47  (1982). 

2.  Mao,  C.,  Sun,  W.  &  Seeman,  N.  C.  Assembly  of  Borromean  rings  from  DNA.  Nature  386,  137-8 
(1997). 

3.  Winfree,  E.,  Liu,  F.,  Wenzler,  L.  a  &  Seeman,  N.  C.  Design  and  self-assembly  of  two-dimensional 
DNA  crystals.  Nature  394, 539-44  (1998). 

4.  Yan,  H.,  Park,  S.  H.,  Finkelstein,  G.,  Reif,  J.  H.  &  LaBean,  T.  H.  DNA-templated  self-assembly  of 
protein  arrays  and  highly  conductive  nanowires.  Science  (80-. ).  301, 1882-4  (2003). 

5.  Rothemund,  P.  W.  K.  Folding  DNA  to  create  nanoscale  shapes  and  patterns.  Nature  440,  297-302 
(2006). 

6.  Han,  D.  et  al.  DNA  gridiron  nanostructures  based  on  four-arm  junctions.  Science  339, 1412-5 
(2013). 

7.  Yin,  P.  et  at.  Programming  DNA  tube  circumferences.  Science  (80-.  ).  321,  824-6  (2008). 

8.  Wei,  B.,  Dai,  M.  &.  Yin,  P.  Complex  shapes  seif-assembled  from  single-stranded  DNA  tiles.  Nature 
485,  623-6  (2012). 

9.  Ke,  Y.,  Ong,  L.  L.,  Shih,  W.  M.  &  Yin,  P.  Three-Dimensional  Structures  Self-Assembled  from  DNA 
Bricks.  Science  (80-. }.  338, 1177-1183  (2012). 

10.  Benson,  E.  et  al.  DNA  rendering  of  polyhedral  meshes  at  the  nanoscale.  Nature  523,  441-444 
(2015). 

11.  Andersen,  E.  S.  et  al.  Self-assembly  of  a  nanoscate  DNA  box  with  a  controllable  lid.  Nature  459, 
73-6  (2009). 

12.  Dietz,  H.,  Douglas,  S.  M.  &  Shih,  W.  M.  Folding  DNA  into  twisted  and  curved  nanoscaie  shapes. 
Science  (80-. ).  325, 725-30  (2009). 

13.  Douglas,  S.  M.  et  al.  Self-assembly  of  DNA  into  nanoscale  three-dimensional  shapes.  Nature  459, 
414-418  (2009). 

14.  Douglas,  S.  M.,  Chou,  J.  J.  8t  Shih,  W.  M.  DNA-nanotube-induced  alignment  of  membrane  proteins 
for  NMR  structure  determination.  Proc.  Natl.  Acad.  Sci.  USA  104,  6644-8  (2007). 

15.  Delebecque,  C.  J.,  Lindner,  A.  B.,  Silver,  P.  a  &  Aldaye,  F.  a.  Organization  of  intracellular  reactions 
with  rationally  designed  RNA  assemblies.  Science  333,  470-4  (2011). 

16.  Fu,  J.,  Liu,  M.,  Liu,  Y.,  Woodbury,  N.  W.  &  Yan,  H.  Interenzyme  substrate  diffusion  for  an  enzyme 
cascade  organized  on  spatially  addressable  DNA  nanostructures.  J.  Am.  Chem.  Soc.  134,  5516-9 
(2012). 

17.  Liu,  M.  et  al.  A  DNA  tweezer-actuated  enzyme  nanoreactor.  Nat  Commun.  4,  2127  (2013). 

18.  Kuzyk,  A.  et  al.  DNA-based  self-assembly  of  chiral  plasmonic  nanostructures  with  tailored  optical 
response.  Nature  483,  311-4  (2012). 

19.  Schreiber,  R.  et  al.  Hierarchical  assembly  of  metal  nanoparticles,  quantum  dots  and  organic  dyes 
using  DNA  origami  scaffolds.  Nat.  Nanotechnol.  9,  74-78  (2013). 


20.  Shen,  X.  et  al.  Rolling  up  gold  nanoparticle-dressed  dna  origami  into  three-dimensionai  piasmonic 
chiral  nanostructures.  /  Am.  Chem.  Soc.  134, 146-149  (2012). 

21.  Lee,  H.  etal.  Molecularly  self-assembled  nucleic  acid  nanoparticles  for  targeted  in  vivo  siRNA 
delivery.  Nat.  Nanotechnol.  7,  389-93  (2012). 

22.  Liu,  X.  et  al.  A  DNA  nanostructure  platform  for  directed  assembly  of  synthetic  vaccines.  Nano 
Lett.  12, 4254-9  (2012). 

23.  Douglas,  S.  M.,  Bachelet,  i.  &  Church,  G.  M.  A  logic-gated  nanorobot  for  targeted  transport  of 
molecular  payloads.  Science  (80-. ).  335,  831-4  (2012). 

24.  Zadeh,  J.  N.  et  al.  NUPACK:  Analysis  and  design  of  nucleic  acid  systems.  J.  Comput.  Chem.  32, 
170-3  (2011). 

25.  Douglas,  S.  M.  et  al.  Rapid  prototyping  of  3D  DNA-origami  shapes  with  caDNAno.  Nucleic  Acids 
Res.  37,  5001-5006  (2009). 

26.  Castro,  C.  E.  et  al.  A  primer  to  scaffolded  DNA  origami.  Nat.  Methods  8,  221-229  (2011). 

27.  Sobczak,  J.-P.  J.,  Martin,  T.  G.,  Gerling,  T.  &  Dietz,  H.  Rapid  Folding  of  DNA  into  Nanoscale  Shapes 
at  Constant  Temperature.  Science  (80- . ).  338, 1458-61  (2012). 

28.  Myhrvold,  C.,  Dai,  M.,  Silver,  P.  a  &  Yin,  P.  Isothermal  self-assembly  of  complex  DNA  structures 
under  diverse  and  biocompatible  conditions.  Nano  Lett.  13, 4242-8  (2013). 

29.  Bai,  X.-C.,  Martin,  T.  G.,  Scheres,  S.  H.  W.  &  Dietz,  H.  Cryo-EM  structure  of  a  3D  DNA-origami 
object.  Proc.  Natl.  Acad.  Sci.  U.  S.  A.  109, 20012-7  (2012). 

30.  Tian,  Y.  et  al.  Prescribed  nanoparticle  cluster  architectures  and  low-dimensional  arrays  built  using 
octahedral  DNA  origami  frames.  Nat.  Nanotechnol.  10,  637-644  (2015). 

31.  Jungmann,  R.  et  al.  Single-molecule  kinetics  and  super-resolution  microscopy  by  fluorescence 
imaging  of  transient  binding  on  DNA  origami.  Nano  Lett.  10, 4756-61  (2010). 

32.  Jungmann,  R.  et  al.  Multiplexed  3D  cellular  super-resolution  imaging  with  DNA-PAINT  and 
Exchange-PAINT.  Nat.  Methods  11,  313-8  (2014). 

33.  linuma,  R.  et  al.  Polyhedra  self-assembled  from  DNA  tripods  and  characterized  with  3D  DNA- 
PAINT.  Science  344,  65-9  (2014). 

34.  Wagenbauer,  K.  F.,  Wachauf,  C.  H.  &  Dietz,  H.  Quantifying  quality  in  DNA  self-assembly.  Nat. 
Commun.  5,3691  (2014). 

35.  Sadowski,  J.  P.,  Calvert,  C.  R.,  Zhang,  D.  Y,,  Pierce,  N.  A.  &  Yin,  P.  Developmental  self-assembly  of 
a  DNA  tetrahedron.  ACS  Nano  8,  3251-3259  (2014). 

36.  You,  M.  et  al.  Photon-Regulated  DNA-Enzymatic  Nanostructures  by  Molecular  Assembly.  ACS 
Nano  5, 10090-10095  (2011). 

37.  Nagalakshmi,  U.  et  al.  The  Transcriptional  Landscape  of  the  Yeast  Genome  Defined  by  RNA 
Sequencing.  Science  (80-. ).  320, 1344-1349  (2008). 

38.  Ingolia,  N.  T.,  Ghaemmaghami,  S.,  Newman,  J.  R.  S.  &  Weissman,  J.  S.  Genome-wide  analysis  in 
vivo  of  translation  with  nucleotide  resolution  using  ribosome  profiling.  Science  324,  218-23 
(2009). 


39.  Churchman,  L.  S.  &  Weissman,  J.  S.  Nascent  transcript  sequencing  visualizes  transcription  at 
nucleotide  resolution.  Nature  469, 368-373  (2011). 

40.  Johnson,  D.  5.,  Mortazavi,  A.,  Myers,  R.  M.  &  Wold,  8.  Genome-wide  mapping  of  in  vivo  protein- 
DNA  interactions.  Science  316, 1497-502  (2007). 

41.  Aird,  D.  et  ai.  Analyzing  and  minimizing  PCR  amplification  bias  in  tllumina  sequencing  libraries. 
Genome  Biol.  12,  R18  (2011). 

42.  Quail,  M.  A.  etal.  A  large  genome  center's  improvements  to  the  lllumina  sequencing  system.  Nat. 
Methods  5,  1005-10  (2008). 

43.  Stahl,  E.,  Martin,  T.  G.,  Praetorius,  F.  &  Dietz,  H.  Facile  and  scalable  preparation  of  pure  and 
dense  DNA  origami  solutions.  Angew.  Chemie-  int.  Ed.  53, 12735-12740  (2014). 

44.  Lin,  C.,  Perrault,  S.  D.,  Kwak,  M.,  Graf,  F.  &  Shih,  W.  M.  Purification  of  DNA-origami 
nanostructures  by  rate-zonal  centrifugation.  Nucleic  Acids  Res.  41,  (2013). 

45.  Kosuri,  5.  etal.  Scalable  gene  synthesis  by  selective  amplification  of  DNA  pools  from  high-fidelity 
microchips.  Nat.  Biotechnol.  28,  1295-9  (2010). 

46.  Marchi,  A.  N.,  Saaem,  I.,  Vogen,  B.  N.,  Brown,  S.  &  Labean,  T.  H.  Toward  larger  DNA  origami.  Nano 
Lett.  14,  5740-5747  (2014). 

47.  Woo,  S.  8i  Rothemund,  P.  W.  K.  Programmable  molecular  recognition  based  on  the  geometry  of 
DNA  nanostructures.  Nat.  Chem.  3,  620-627  (2011). 


BEARS;  Barcode  Extension  for  Analysis  and  Reconstruction  of  Structures 


1 .  Self-assembly 


2.  Gel  purification  3.  Barcode  ligation  4.  Pooling 


5.  Sequencing 


p\ 

Ml 

P2 

m 


PI 

Ml  r 

Barcode'  'Read 

Barcode 

AACOAT'-a  ATGGOTAGCC  .  .  . 

SCCTt(5ftA 

L-  GAT  tv :  'TTGATOATGA . , . 

CTGAAOCAAT.  ,  ♦ 

ACCTTlSfjA  - 

AaitTGCTTfiA.  .  . 

AQCTT'MA  ■  :  i 

Mh  lOAItja  CCAG&TTTAG. .  - 

AACGTOM  ATGGCTAGCC .  .  . 

ACCT&3GA  ■■ 

b 


Design 


Read  list 


Thresholded  image  SRI  map 


Strand  Monomer  Product 

ATGGCTW3&.., .  54  62 

CTQATGATGA. . ,  73  80 

^TGAACCAAT, ,,  106  47 

AGATGCTTGA , .  .  S3  ios 

CCAGGTTTAG* . .  TS  SB 

*0 


Figure  1.  BEARS  uses  sequencing  to  assay  DNA  nanostructure  assembly.  We  show  schematics  of  each  of 
the  5  major  parts  of  BEARS:  structure  assembly,  purification,  barcode  ligation,  pooling,  and  next- 
generation  sequencing  (a).  Structures  are  assembled,  and  well-assembled  products  and  unincorporated 
monomers  are  isolated  on  an  agarose  gel.  Barcodes  and  sequencing  adapters  are  ligated  onto  each 
sample  in  separate  reactions,  and  then  samples  are  pooled  and  sequenced  using  NGS.  We  also  show  an 
overview  of  the  data  analysis  pipeline  used  to  generate  a  quantitative,  spatial  map  of  strand 
incorporation  (b).  We  start  with  a  molecular  canvas  and  a  structure  design  containing  a  subset  of  the 
DNA  brick  strands  within  the  canvas.  After  self-assembiy  and  sequencing,  we  have  a  list  of  read  counts 
for  each  brick  strand  in  the  product  and  monomer  bands.  We  threshold  the  sequencing  data  to 
determine  which  brick  strands  are  part  of  the  assembled  structure  and  which  are  not  {see  Methods 
section  for  details).  Finally,  we  compute  the  SRI  for  each  strand  that  passed  the  threshold  (see  Methods 


for  details). 
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Figure  2.  Sequencing  is  reproducible  and  quantitative.  We  assembled  and  sequenced  a  2D  DNA  brick 
structure  (the  letter  B)  on  two  separate  occasions  using  BEARS,  and  calculated  the  SRI  for  each  replicate 
experiment  (a).  A  scatterplot  of  the  SRI  values  from  each  replicate  is  shown  (b).  The  correlation 
coefficient  between  the  two  replicates  is  indicated  in  the  upper  left  corner  of  the  graph.  We  performed 
oligo  mixing  experiments  in  which  5  pools  containing  6  unique  oligonucleotides  species  each  were  mixed 
together  with  varying  stoichiometries  (c).  We  compared  the  observed  and  measured  read  fractions  of 
these  oligonucleotide  pools  after  PCR  and  NGS  using  pre-extended  oligonucleotides  (d)  and  after 
ligation,  PCR,  and  NGS  (e).  Error  bars  indicate  one  standard  deviation  based  on  measuring  each  of  the  5 


pools  with  a  given  expected  read  fraction. 
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Figure  3.  BEARS  recapitulates  the  shape  and  SRI  of  tiles  into  2D  DNA  brick  structures.  We  tested  6 
structures:  the  letter  B  (a),  the  letter  E  (b),  the  letter  A  (c),  the  letter  R  (d),  the  letter  S  (e),  and  a  bear 
shape  (f).  In  each  panel,  we  show  design  schematics  (top  row),  4  representative  AFM  images  of  fully- 
assembled  structures  (second  row),  thresholded  images  generated  from  SRI  data  (third  row)  and  SRI 
data  from  BEARS  (bottom  row).  Each  box  in  the  schematics  and  incorporation  renderings  indicates  one 


tile.  In  the  AFM  images,  the  scale  bars  are  50  nm  long. 


Figure  4.  Extending  BEARS  to  3D  DNA  brick  structures.  We  self-assembled  a  3D  DNA  brick  cuboid 
structure  containing  a  hole  along  the  long  axis  (a).  We  show  TEM  images  of  the  structure  and  SRI  data 
from  BEARS  are  projected  along  each  of  the  3  major  dimensions  using  MATLAB  (b,  c).  Scale  bar:  100  nm 
in  (b),  10  nm  in  (c).  It  is  not  possible  to  distinguish  the  X-projections  and  Y-projections  using  the  TEM. 

We  render  incorporation  data  from  BEARS  in  3  dimensions  for  each  layer  of  the  structure  using  MATLAB 
(d).  The  location  and  order  of  each  layer  is  indicated  using  the  dotted  lines.  Spacing  between  adjacent 


helices  is  set  to  2  nm,  and  each  layer  is  approximately  8.1  nm  thick. 
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Figure  5.  Extending  BEARS  to  2D  and  3D  DNA  origami  structures.  We  show  AFM  images  and  BEARS 
incorporation  data  for  a  2D  origami  rectangle  (a,  b).  Scale  bar  in  (a):  100  nm.  We  show  TEM  images  and 
BEARS  incorporation  projections  for  a  3D  origami  cuboid  (c,  dj.  Scale  bar:  50  nm  in  (c),  10  nm  in  (d).  It  is 
not  possible  to  distinguish  the  X-projections  and  Y-projections  using  the  TEM.  We  render  SRI  data  from 
BEARS  in  3  dimensions  for  slices  through  the  3D  origami  cuboid  using  MATLAB  (e).  The  location  and 
order  of  each  slice  is  indicated  using  the  dotted  lines.  Spacing  between  adjacent  helices  is  set  to  2  nm, 


and  each  slice  is  5  nm  thick. 
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DNA  Origami  [3]  is  a  robust  and  programmable  technique  to  make  DNA  nanostructures. 
A  key  use  of  DNA  origami  structures  has  been  as  nanoscale  breadboards  that  organize  com¬ 
ponents  like  small  proteins,  nanoparticles,  small  molecules  and  other  DNA  strands.  Typical 
DNA  origami  assembly  requires  that  the  scaffold  strand  be  annealed  with  staple  strands 
in  the  appropriate  buffer.  The  high  temperatures  (>  80C)  used  in  this  process  are  not 
always  compatible  with  temperature-sensitive  components  or  contexts.  In  response,  there 
have  been  attempts  to  isothermally  fold  DNA.  Studies  have  show  that  DNA  origami  can 
assemble  isothermally  if  held  at  temperatures  around  60C  [4,  5] ,  or  at  room  temperature  by 
using  denaturing  agents  [1,  2].  However,  rapid  assembly  of  DNA  origami  at  room  temper¬ 
ature  without  the  use  of  denaturing  agents,  which  can  be  toxic  or  have  other  unintended 
consequences,  has  not  been  achieved. 

We  assembled  a  rectangular  DNA  origami  from  a  1000  base  long  DeBruijn  derived  scaffold 
sequence  and  a  corresponding  staple  set.  The  origami  assembles  isothermally  in  as  little  as 
five  minutes  at  temperatures  ranging  from  25C  to  50C. 


Figure  1;  Left:  A  random  1000  base  four  letter  scaffold  sequence  has  significant  secondary  structure. 
Center:  A  three  letter  scaffold  sequence  has  almost  no  secondary  structure  in  the  same  conditions. 
Analyzed  with  NUPACK.  Right:  DNA  origami  assembled  from  a  1000  base  long  DeBruijn  derived 
scaffold  sequence  and  a  corresponding  staple  set.  The  scaffold  and  staple  strands  were  mixed 
together  and  incubated  at  room  temperature  for  five  minutes  in  IX  TE  buffer  supplemented  with 
10mA/Mg++.  Image  size  lum  x  him. 


Traditional  DNA  origami  designs  are  faced  with  prohibitive  kinetic  barriers  to  assembly 
at  room  temperature.  Heating,  or  using  denaturing  agents,  helps  overcome  these  barriers, 
Here,  we  take  an  alternate  approach.  We  use  computational  sequence  design  to  eliminate 
prohibitive  kinetic  barriers  and  thus  assemble  DNA  origami  at  room  temperature  in  as  little 
as  five  minutes. 


We  conjectured  that  the  key  contributions  to  kinetic  barriers  are:  i)  secondary  structure 
of  the  scaffold,  ii)  secondary  structure  of  staple  strands  and  iii)  unintended  staple-staple 
interactions.  We  circumvent  these  issues  by  the  use  of  a  three  letter  code.  The  scaffold 
sequence  is  chosen  from  the  three  letter  alphabet  {A,T,  C}  and  the  staple  sequences  from  the 
complementary  alphabet  {  A,  T,  G).  Mfold  and  MUPACK  analysis  of  the  resulting  sequences 
shows  dramatic  reduction  in  predicted  secondary  structure  (Fig.  1)  and  unintended  staple- 
staple  interactions  through  this  strategy,  A  potential  drawback  of  a  three  letter  code  is 
reduced  sequence  diversity,  A  four  letter  code  affords  4a  ^  65000  distinct  8-mers  while  a 
three  letter  code  affords  only  3s  6500.  Thus,  a  poorly  chosen  three  letter  scaffold  sequence 
could  have  poor  sequence  diversity  that  results  in  unintended  staple-scaffold  interactions, 
resulting  in  kinetic  barriers  to  assembly.  We  attempt  to  minimize  such  bad  choices  by  using 
DeBmijn  sequences  to  maximize  sequence  diversity*  A  DeBruijn  sequence  S  of  order  n 
over  the  three  letter  alphabet  {A,T,  C}  is  a  sequence  where  every  possible  subsequence  of 
length  n  appears  exactly  once.  DeBruijn  sequences  can  be  generated  computationally  in  time 
linear  in  the  length  of  the  sequence.  We  adapt  DeBruijn  sequences  to  filter  out  degenerate 
subsequences  and  ensure  that,  C  content  is  evenly  distributed  across  the  sequence.  We  call 
the  resulting  sequence  DeBruijn  derived  and  choose  it  as  the  scaffold  sequence. 
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Quantitative  super¬ 
resolution  imaging 
with  qPAINT 
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Zachary  Feiger4,  Avital  Rodal4  &  Peng  Yinls2 

Counting  molecules  in  complexes  is  challenging,  even  with 
super-resolution  microscopy.  Here,  we  use  the  programmable 
and  specific  binding  of  dye-labeled  DNA  probes  to  count 
integer  numbers  of  targets.  This  method,  called  quantitative 
points  accumulation  in  nanoscale  topography  (qPAINT), 
works  independently  of  dye  photophysics  for  robust  counting 
with  high  precision  and  accuracy  over  a  wide  dynamic  range. 
qPAINT  was  benchmarked  on  DNA  nanostructures  and 
demonstrated  for  cellular  applications  by  quantifying  proteins 
in  situ  and  the  number  of  single-molecule  FISH  probes  bound 
to  an  mRN A  target. 

Optical  super-resolution  microscopy  is  revolutionizing  the  way 
we  study  biology.  It  allows  researchers  to  achieve  a  spatial  reso¬ 
lution  below  the  diffraction  limit  of  light1,  providing  insights 
that  were  previously  impossible2.  Super- resolution  studies  often 
focus  on  the  visualization  of  synthetic  or  cellular  structures  with 
sub-diffraction  spatial  resolution.  However,  data  sets  obtained 
by  stochastic  switching  and  readout  methods3"5  contain  a  wealth 
of  information  that  can  he  explored  for  quantitative  studies6"13 
beyond  "just1  binning  molecule  localizations  for  spatial 
visualization.  These  counting  techniques  typically  rely  on  com¬ 
plex  modeling  of  blinking  properties  for  target-bound  fluorescent 
dyes.  However,  with  current  methods  it  is  difficult  to  accurately 
and  precisely  quantify  the  number  of  target-bound  fluorophores, 
especially  for  a  large  number  of  fluorescent  proteins  or  dyes  in 
dense  clusters.  First,  the  dyes  typically  have  complex,  environmen¬ 
tally  sensitive  photophysics  that  are  difficult  to  model.  Different 
switching  properties  for  dissimilar  dyes  further  complicate 
multiplexed  quantitative  imaging14.  Furthermore,  irregular  sam¬ 
ple  illumination  or  varying  excitation  and  activation  intensities 
can  lead  to  variation  in  switching  kinetics  and  thus  inaccurate 
quantification13.  Additionally  target-bound  dyes  can  be  bleached 
prematurely,  before  sufficient  localizations  are  collected  to  allow 
for  accurate  and  precise  quantification. 


We  introduce  a  simple  and  robust  quantitative  super- resolution 
method  called  quantitative  PAINT  (or  qPAINT)  based  on  DNA- 
PAINT1^19,  Instead  of  relying  on  stochastic  switching  of  target- 
bound  dyes3  4,  DNA-PAINT  achieves  apparent  blinking  of  targets 
via  transient  binding  of  free-floating  dye-labeled  'imager'  strands  to 
complementary  target-bound  'docking*  strands.  By  analyzing  the  pre¬ 
dictable  binding  kinetics  between  the  imager  and  docking  strands, 
qPAINT  counts  the  number  of  targets  without  spatially  resolving 
them.  As  compared  to  existing  quantification  methods,  qPAINT  has 
two  unique  features:  it  explicitly  decouples  blinking  from  dye  photo- 
physics  and  it  is  immune  to  photobl  caching  (as  dye -labeled  imager 
strands  are  continuously  replenished  from  the  solution). 

Thanks  to  these  features,  qPAINT  represents  a  conceptual  frame¬ 
work  that  can  simultaneously  achieve  high  accuracy  precision, 
a  wide  dynamic  range,  robustness,  and  multiplexing  capability 
for  quantifying  the  number  of  labeled  targets.  More  specifically,  by 
explicitly  avoiding  analyzing  the  hard-to-predict  and  illuminaLion- 
dependent  photophysical  kinetics  of  dyes,  qPAINT  achieves  high 
quantification  accuracy  Immunity  to  photobleathing  permits 
arbitrarily  long  imaging  time  and  Lhe  collection  of  a  large  number 
of  blinking  events,  leading  to  high  counting  precision.  The  easily 
adjustable  influx  rate  of  the  imager  strands  makes  such  high 
accuracy  and  precision  achievable  over  a  wide  dynamic  range. 
Consistent,  predictable,  and  easily  calibratable  (when  required) 
binding  kinetics  allows  qPAINT  to  work  robustly  under  diverse 
conditions  and  with  different  dyes.  Finally  because  it  decouples 
the  apparent  blinking  from  the  photophysical  properties  of  dyes, 
qPAINT  is  easily  multipiexable  (spectrally  and  sequentially17). 
Figure  1  illustrates  the  principle  of  qPAINT  with  the  example 
of  protein  quantification  in  resolution-limited  spots  in  a  fixed 
cell  The  region  in  Figure  la  consists  of  one  protein  spaced  about 
200  nm  away  from  a  small  cluster  formed  by  three  proteins  (which 
are  spaced  -5  nm  apart).  Current  super-resolution  techniques 
fall  short  of  resolving  individual  proteins  in  Lhe  small  cluster. 
Thus,  simple  spatial  counting  would  underestimate  the  total 
number  of  proteins.  However,  by  analyzing  the  predictable  and 
programmable  binding  kinetics  of  imager  strands  in  DNA- PAINT 
rather  than  spatially  resolving  individual  targets,  it  is  possible  to 
quantify  Integer  numbers  of  molecules  in  these  resolution-lim¬ 
ited  areas.  Single-molecule  DNA  hybridization  and  dissociation 
can  be  described  using  a  simple  kinetic  model  with  a  second- 
order  association  rate  kon  and  a  first-order  dissociation  rate  kQ ff. 
These  kinetic  constants  determine  the  fluorescence  on-  and 
off- times  (rb  for  bright- time  and  Tj  for  dark -time,  respectively). 

is  linked  to  via  =  koff1  and  is  linked  to  the  influx  rate 
of  imager  strands  -  kQn  x  q  by  ij  =  (kon  x  q)_1  =  where 
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DNA-PAINT 


qPAINT 


Figure  1  |  qPAINT  principle*  (a)  In  DNA-PAINT, 
fluprescently  labeled  Imager*  strands 
(P*)  transiently  bind  from  solution  to 
complementary  'docking'  strands  (P)  attached 
to  a  target.  Intensity  vs,  time  traces  show 
characteristic  fluorescence  on-  and  off-times 
{%  and  %  respectively).  qPAINT  uses  the 
predictable  blinking  kinetics  to  deduct 
molecule  numbers,  (b)  The  number  of 
binding  sites  can  be  calculated  given  a 
known  probe  influx  rate  ^  *  q.  Stochastic 

simulations  of  DNA-PAINT  binding  events  show 

a  linear  relationship  between  simulated  and  'measured'  numbers  of  binding  sites  (mean  ±  s.d.).  The  counting  precision  for  a  given  number  of  sites  is 
dependent  on  the  probe  influx  rate  ^  (green,  0*01  s~T;  orange,  0*005  s~!). 


m 


C\  is  the  imager  strand  concentration.  If  a  single  protein  molecule 
labeled  yvith  a  single  docking  strand  "blinks* 1  with  a  frequency 
of  2  in  a  certain  time  interval  then  three  molecules  (containing 
three  docking  sites)  will  blink  with  three  times  the  frequency* 
given  a  constant  influx  rate  £  (Fig-  la)-  To  practically  quantify 
the  number  of  binding  sites  from  the  intensity  versus  time 
traces,  we  first  determine  the  mean  dark- time  Tj*  from  the  cumu¬ 
lative  distribution  function  (Supplementary  Fig*  1)  in  an  area 
of  interest,  and  then  calculate  ihe  number  of  binding  sites  as 
(kon  x  a  x  Td*)_1  =  (£  x  Td0_1. 

We  first  performed  qPAINT  in  silica  by  stochastically  simulating 
DNA-PAINT  data  and  plotting  the  results  obtained  by  qPAINT  ver¬ 
sus  the  'true1  molecule  number  used  as  input  (Fig.  lb).  The  linear 
relationship  between  the  results  and  the  Lrue  molecule  number  over 
a  wide  range  of  binding  sites  shows  that  counting  with  qPAINT  is 
feasible  (see  Supplementary  Fig.  2  for  discussion  regarding  bind¬ 
ing  frequency  readout).  q  PAINTs  counting  precision  (1  -  cv  with 
cy  ~  a/fd  :  coefficient  of  variation  with  olfd  s.d*  and  mean,  respec¬ 
tively)  can  be  increased  by  optimizing  the  probe  influx  rate  4  or 
extending  image  acquisition  (Supplementary  Fig.  3). 

Next,  we  experimentally  benchmarked  qPAINT  in  vitro 
and  compared  it  to  our  in  silica  results.  We  adopted  a  DNA 
origami20 -based  imaging  quality  benchmark  platform  and  an 
accompanying  drift  correction  and  high -resolution  image  analysis 


method25*  The  origami  was  designed  to  display  12  binding  sites  that 
can  be  visually  separated  using  an  advanced  image  analysis  method 
that  can  achieve  ultra-high-molution  imaging  to  discretely 
visualize  each  target  in  a  densely  packed  cluster21  (Fig*  2a, 
design  details  in  Supplementary  Fig.  4),  The  visually  counted 
(ground  truth1}  number  of  sites  can  be  compared  to  the 
qPAINT  analysis  for  the  same  struclures  omitting  any  a  priori 
spatial  information  (Fig*  2b}*  High -resolution  DNA-PAINT 
imaging  revealed  that  not  all  origami  showed  12  binding  sites 
(Fig*  2b;  see  also  Supplementary  Fig*  5}*  likely  resulting  from 
missing  docking  strands15  (see  Supplementary  Fig.  6  for  incor¬ 
poration  efficiency  measurement}.  Comparing  qPAINT  in  vitro 
and  in  silico  (Fig.  2c)  versus  Lhe  ground  truth  showed  good  agree¬ 
ment  (90%  precision*  97%  accuracy).  Adjusting  the  imager  influx 
rate  to  £  -  0*03  s-1  and  extending  image  acquisition  time  further 
increased  the  precision  and  accuracy  to  95,4%  and  99*6%,  respec¬ 
tively  (Fig.  2c  and  Supplementary  Fig,  7)*  This  then  allowed  us  to 
distinguish  between  integer  numbers  of  sites  (i.e.  between  9S  10}  1 1* 
and  12*  Fig*  2d). 

While  counting  a  few  molecules  is  possible  using  stepwise  pho¬ 
tobleach  ing  approaches22  or  photon  statistics23,  these  approaches 
are  less  effective  for  counting  higher  fluorophore  densities24. 
For  this  reason,  we  sought  to  demonstrate  similar  qPAINT 
performance  for  a  higher  target  density.  Grouping  four  DNA 


Binding  sites  (tf  suar) 


Binding  sites  {qPAINT) 


Figure  2  1  qPAINT  in  vitro  benchmarking. 

(a)  DNA  origami  structure  with  12  designed 
docking  sites,  (b)  DNA-PAINT  image  of  the 
structures,  (c)  Visual  counting  (x-axis) ,  in  silico 
simulation  (gray),  and  in  vitro  experimental 
qPAINT  data  (orange  and  green)  are  in  good 
agreement  (97%  accuracy  and  90%  precision  for 
%  =  0.02  s_1  and  25  min  imaging  time;  99*6% 
accuracy  and  95.4%  precision  for  £  -  0.03 
s_1  and  166  min  imaging  time;  error  bars, 

1  s*d.}*  (d)  Distributions  plotted  from  data  in 
c  (green  data  points)  demonstrate  qPAINTs 
ability  to  distinguish  between  integer  numbers 
of  binding  sites  (i.e*,  9  vs.  10  vs.  11  vs.  12; 

Tukey  post  hoc  test;  -  1,032.52,  *P  <  0.01, 

Supplementary  Fig.  7),  (e)  Dynamic  range* 

Same  DNA-PAINT  data  set  as  in  c  (orange  data 
points)  reanalyzed  by  grouping  four  DNA  origami 
structures  together,  (f)  Comparison  between  visual  counting  (x-axis)  and  in  vitro  (orange)  qPAINT  analysis;  error  bars,  1  s.d.  (g)  In  silico  analysis  of  the 
counting  error  (coefficient  of  variation,  q,)  dependency  on  the  number  of  binding  sites  and  imager  strand  infLux  rate.  Tuning  £  (experimentally  adjustable 
over  a  wide  range)  can  reach  optimal  conditions  with  low  counting  errors  {<10%)  for  virtually  any  number  of  bindings  sites,  (h)  Multiplexed  qPAINT. 

Three  distinct  DNA  origami  structures  (similar  to  those  in  c)  with  orthogonal  docking  strand  sequences  (red  PI,  green  P3  and  blue  P5;  error  bars,  1  s.d.) 
were  imaged  sequentially  using  Exchange- PAINT.  Tnset  shows  qPAINT  analysis  on  11  binding  sites  structures*  Scale  bars:  100  nm  (b,h),  500  nm  (e). 
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Figure  3  |  qPAINT  in  situ ,  (a-c)  qPAINT  in  situ  benchmarking  with  Nup98. 
(a)  DNA-PAINT  image  of  Nup9S  proteins  in  NPCs,  Inset:  labeling  and 
imaging  schematic  for  N PC  proteins.  Single  targets  (marked  with  arrows) 
are  used  for  influx  rate  calibration,  <b)  MPC  structures  displaying  three, 
four,  five,  and  six  distinct  Nup98  protein  clusters  were  respectively  grouped 
for  qPAINT  analysis,  (c)  Comparison  between  visuaL  counting  (x-axis)  and 
qPAINT  data  (orange):  error  bars,  1  s.d,  (d,e)  Brp  qPAINT  experiments. 

(d)  DNA-PAINT  image  of  Drosophila  NMJs  obtained  by  using  secondary  DNA- 
antibody  conjugates  and  primary  monoclonal  antibodies  against  BrpNcS?, 
Zoomed-in  view  (iH  DNA-PAINT;  if  diffraction  limited)  of  the  highlighted 
area  in  d  showing  two  separate  CAZ  units  (assemblies  of  Brp  proteins 

into  multi  protein  clusters1?).  Green  circles  indicate  single  targets. 

(e)  qPAINT  quantification  (rr  =  981)  indicating  that  the  average  number  of 
Brp  molecules  per  CAZ  unit  is  142  ±  39  (mean  ±  s.d,)*  (fPg)  smFISH  qPAINT 
experiments,  (f)  SUZ12  mRNAs  molecules  are  tagged  using  single- stranded 
oligonucleotides  with  binding  sequences  unique  to  a  part  of  the  target 
mRNA  (rl*-rS4*),  a  fixed  Cy3B  label  and  a  single-stranded  DNA- PAINT 
docking  strand  (pi),  (g)  qPAINT  quantification  (r?  =  301)  yields  -45  probes 
bound  to  a  singLe  mRNA  molecuLe  (-90  probes  for  two  mRNAs),  revealing 
-70%  hybridization  efficiency  of  the  FISH  probes  to  the  mRNA  target. 

Scale  bars:  500  nm  (a,  insets  i  and  ii  in  d),  50  nm  (b),  1  pm  (d,f). 


origami  from  the  previous  data  set  Into  one  region  of  interest 
for  qPAINT  analysis  allowed  us  to  assay  q  PA  I  NT’s  performance 
for  coanting  48  sites  (Fig*  2e;  note  that  spatial  information  was 
still  disregarded).  Again,  qPAINT  versus  ground  truth  analysis 
showed  good  agreement  (Fig.  2f;  98%  accuracy,  85%  precision). 
Additionally,  we  performed  qPAINT  on  DNA  origami  with  12,48, 
and  150  binding  sites,  respectively,  yielding  similar  performance 
(see  Supplementary  Figs,  4  and  8  for  design  and  results). 

Next,  we  performed  an  in  silico  study  to  determine  the  counting 
error  for  a  given  set  of  binding  sites  and  imager  strand  influx  rate 
^  (Fig,  2g).  The  results  show  that  £  can  be  tuned  to  select  an  opti¬ 
mal  "working’  point  depending  on  the  expected  number  of  binding 
sites,  thus  achieving  high  counting  precision  over  a  wide  dynamic 
range.  Specifically,  one  should  be  able  to  obtain  counting  errors 
below  10%  independent  of  the  number  of  binding  sites.  Finally,  we 
demonstrated  the  ability  of  qPAINT  to  count  multiple  target  spe¬ 
cies  in  the  same  sample  with  similar  performance  by  performing 
Exchange -PAINT 17  on  three  distinct  DNA  origami  structures  in  the 
same  sample  (Fig.  2h)  using  orthogonal  imager  strand  sequences. 
Moving  from  a  clean  in  vitro  to  a  more  complex  in  situ  environ¬ 
ment,  ^ve  next  evaluated  qPAINT  on  DNA  origami  structures  bound 
to  cell  surface  proteins  or  micro  injected  in  the  nuclear  and  cytoplas¬ 
mic  regions  of  fixed  cells  (Supplementary  Fig.  9).  In  both  cases, 
qPAINT  performed  favorably,  achieving  accuracy  and  precision 
similar  to  those  for  the  in  vitro  DNA  origami  results.  Comparing 
nuclear  and  cytoplasmic  counting  results  yielded  similar  numbers, 
demonstrating  the  consistent  binding  kinetics  of  qPAINT  in  dif¬ 
ferent  locations  in  fixed  cells  and  suggesting  its  applicability,  for 
example,  for  counting  proteins  or  DNA  or  RNA  molecules  in  situ , 
We  next  benchmarked  qPAINT  s  performance  in  situ  by  quan¬ 
tifying  the  number  of  individual  units  that  form  the  symmetrical 
ring  structure  of  the  nuclear  pore  complexes  (NPCs)  by  targeting 
the  nucleoporin  Nup98  (anchored  mainly  to  the  inner  ring  of  the 
NPC35*26)  in  U20S  cells  using  monoclonal  primary  antibodies 
directly  conjugated  to  docking  strands  (Fig.  3a).  Due  to  variation  of 
Nup98  protein  units  present  in  each  NPC  and  potentially  imperfect 
antibody  labeling  (Fig*  3a),  not  all  complexes  show  the  same 
number  of  individual  proteins  units  (Fig*  3b  and  Supplementary 
Fig,  10).  We  first  used  single  isolated  Nup98  protein  dusters  as 
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calibration  for  the  probe  influx  rate.  The  association  rate  was 
determined  by  using  the  mean  value  from  the  Gaussian  fit¬ 
ting  of  the  dark-time  distribution  from  the  individual  protein 
localizations  (Fig,  3a).  We  subsequently  grouped  NPC  structures 
according  to  their  number  of  visible  Nup98  protein  units.  We 
then  plotted  for  each  group  the  distribution  obtained  by  qPAINT 
versus  the  visually  determined  ground  truth  and  we  obtained  95% 
accuracy  and  84%  precision  (Fig.  3c). 

We  then  applied  qPAINT  to  quantify  the  number  of  single 
Bruchpilot  proteins  (Brp),  which  are  structural  and  functional 
components  of  the  cytomatrix  at  the  synaptic  active  zone  (CAZ) 
in  the  Drosophila  neuromuscular  junction  (NM])27.  Here,  we 
used  monoclonal  primary  antibodies  (BrpNcS2)12*27  and  DNA- 
conjugated  secondary  antibodies  for  labeling.  Similarly  to  what 
was  done  in  a  previous  study12,  we  defined  a  CAZ  uniL  as  an 
interconnected  region  of  Brp  molecules  with  an  elliptical  shape 
(Fig,  3d,  i,  and  Supplementary  Fig.  1 1),  Subsequent  qPAINT 
yielded  142  ±  39  (mean  ±  s.d,)  Brp  molecules  per  CAZ,  consistent 
with  previously  reported  numbers  using  quantitative  dSTORM 12 
(Fig.  3e),  Note  that  more  than  one  secondary  antibody  can  be 
bound  per  primary  antibody.  This,  together  with  the  variability 
of  antibodies  bound  per  subunit,  will  lead  to  a  larger  "labeling1 
variability.  However,  unlike  previous  work  involving  elaborate 
calibration  using  antibody  titration12,  qPAINT  simply  uses  single 
isolated  targets  to  calibrate  the  influx  rate  as  described  above. 

Finally,  we  applied  qPAINT  to  quantify  the  number  of  probes 
bound  to  an  mRNA  target  (SUZ12)  in  single-molecule  fluores¬ 
cence  in  situ  hybridization  (smFISH)  experiments  (Fig*  3f;  see 
Supplementary  Fig*  12  for  a  larger  view).  We  used  a  probe  set  of 
64  smFISH  probes2®  consisting  of  a  5'-Cy3B  label  and  a  docking 
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strand  on  the  3'  end  (inset  Fig,  30*  After  cell  fixation,  labeling29 
(see  Online  Methods  for  details),  and  imaging,  colocalization 
between  Cy3B  and  DNA-PAINT  was  observed  (Fig,  30*  Subsequent 
qPAINT  analysis  yielded— for  the  first  time,  to  our  knowledge— the 
number  of  in  sitw-bound  smFISH  probes  per  mRNA  molecule. 
The  bimodal  distribution  showed  a  first  peak  at  ~45  and  a  second 
peak  at  -90,  consistent  with  one  and  two  mRNA  molecules  in  a 
resolution-limited  area,  respectively  (Fig*  3g).  Note  that  counting 
less  than  the  designed  64  smFISH  strands  is  expected,  as  it  is  likely 
that  not  all  FISH  probes  were  bound  at  each  mRNA  molecule*  This 
experiment  thus  indicates  that  -70%  of  all  designed  smFISH  probes 
were  bound  to  their  targets,  with  a  variance  of -36%. 

As  qPAINT  decouples  blinking  kinetics  from  dye  photophvsks 
and  is  immune  to  photobieachmg  (see  Supplementary  Fig,  13), 
it  avoids  typical  hmdercounting'  errors  due  to  already  photo- 
bleached  or  Inactive* 1 11  dye  labels,  as  well  as  ‘overcounting*  errors 
due  to  blinking  artifacts  that  are  unaccounted  for10-50,  qPAINT 
works  robustly  under  diverse  experimental  conditions  with  con¬ 
sistent  high  accuracy  and  precision  (see  Supplementary  Fig.  14 
for  day-to-day  reproducibility).  Although  qPAINT  performs 
robustly,  it  is  still  reliant  on  stoichiometric  labeling  of  protein 
targets.  Imperfect  labeling  could  potentially  lead  to  undercount- 
ing*  Similarly  to  the  approach  taken  with  the  previous  quantitative 
approach,  4STORM12,  we  opted  to  use  standard  immunostain- 
ing  to  label  endogenous  protein  targets  with  antibodies  and  we 
obtained  comparable  quantification  results,  but  with  a  simpler 
in  situ  calibration  using  isolated  single  targets.  For  future  quanti¬ 
fication  of  proteins  in  compact  clusters,  alternative  labeling  tech¬ 
niques  (for  example,  nanobodies31,  aptamers32,  modified  amino 
acids33,  or  small-molecule  binders)  need  to  be  developed.  In  our 
present  work,  we  demonstrated  qPAINT  using  DNA-PAINT,  but 
the  general  concept  may  be  generalized  to  other  PAINT  methods 
(for  example,  recently  developed  IRIS  probes34)* 

METHODS 

Methods  and  any  associated  references  are  available  in  the  online 
version  of  the  paper* 

Nate:  Any  Supplementary  Information  and  Source  Data  pies  are  available  in  the 
online  version  of  the  paper. 
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ONLINE  METHODS 

Materials.  Non-mod ified  and  amino- modified  DNA  oligonu¬ 
cleotides  were  purchased  from  Integrated  DNA  Technologies 
(Coralville,  I  A).  Fluorescently-labeled  DNA  oligonucleotides 
were  purchased  from  Biosynthesis  (Lewisville,  TX).  Biotinylated 
monoclonal  antibody  against  EGF  receptor  was  purchased  from 
Cell  Signaling  (Cat.  No.  6627,  Danvers,  MA).  Streptavidin  was 
purchased  from  Invitrogen  (S-888,  Carlsbad,  CA).  Bovine  serum 
albumin  (fLSA),  BSA-Biotin  and  Triton  X-100  was  obtained  from 
Sigma  Aldrich  (AS549,  St.  Louis,  MO).  Whole  cell  blue  stain  was 
obtained  from  Thermo  Scientific  (8403501,  Rockford,  IL).  Glass 
slides  and  c overslips  were  purchased  from  VWR  (Radnor,  PA). 
Lab-Tek  II  chambered  coverglass  was  purchased  from  Thermo 
Fisher  Scientific  (Billerica,  MA).  Ml3mpl8  scaffold  was 
obtained  from  New  England  BioLabs  (Ipswich,  MA).  Freeze  ‘N 
Squeeze  columns  were  ordered  from  Bio- Rad  (Hercules,  CA). 
Paraformaldehyde  and  glutaraldehyde  were  obtained  from 
Electron  Microscopy  Sciences  (Hatfield,  PA).  Protocatechuic 
acid  (PCA),  Protocatechuate- 3,4-dioxygenase  (PCD),  and  Trolox 
(TX)  were  purchased  from  Sigma- Aldrich  (37580 -25G-F  (PCA), 
P8279-25UN  (PCD),  238813-5G  (TX))  (St.  Louis,  MO). 

Four  buffers  were  used  for  sample  preparation  and  imaging: 
buffer  A  (It)  mM  Tris-HCl,  100  mM  NaCl,  0.05%  Tween -20,  pH 
7.5),  buffer  B  (5  mM  Tris-HCl,  10  mM  MgCl2,  1  mM  EDTA, 
0,05%  Tween -20,  pH  8),  buffer  B+  (5  mM  Tris-HCl,  10  mM 
MgCl2*  1  mM  EDTA,  0.05%  Tween -20,  pH  8,  supplemented  with 
1  mM  PCA,  1  mM  PCD,  and  I  mM  TX),  and  buffer  C  (lx  PBS, 
500  mM  NaCLpHS). 

Super-resolution  optical  setup,  fluorescence  imaging  was  car¬ 
ried  out  on  an  inverted  Nikon  Eclipse  Ti  microscope  (Nikon 
Instruments,  Melville,  NY)  with  the  Perfect  Focus  System,  apply¬ 
ing  an  objective -type  TIRF  configuration  using  a  Nikon  TIRF 
illuminator  with  an  oil-immersion  objective  (CFI  Apo  TIRF 
100x>  NA  1.49,  Oil),  corresponding  to  a  final  pixel  size  of  160  nm. 
Three  lasers  were  used  for  excitation:  488  nm  (200  mW  nominal, 
Coherent  Sapphire,  Santa  Clara,  CA),  561  nm  (200  mW  nomi¬ 
nal,  Coherent  Sapphire)  and  647  nm  (300  mW  nominal*  MBP 
Communications,  Canada).  The  laser  beam  was  passed  through 
cleanup  filters  (ZT488/10,  ZET561/10,  and  ZET640/20,  Chroma 
Technology,  Bellows  Falls,  VT)  and  coupled  into  the  microscope 
objective  using  a  multi-band  beam  splitter  (ZT488rdc/ZT561rdc/ 
ZT64Grdc,  Chroma  Technology),  Fluorescence  light  was  spec¬ 
trally  filtered  with  emission  filters  (ET525/50m,  ET60G/50m,  and 
ET700/75m,  Chroma  Technology)  and  imaged  On  an  EM  CCD 
camera  (iXon  X3  DU -897,  Andor  Technologies,  North  Ireland). 

Con  focal  setup.  Confocal  imaging  was  carried  out  on  a  Zeiss 
Axio  Observer  with  LSM  710  scanning  laser  confocal  system 
(Zeiss,  Thornwood,  NY)  using  the  following  excitation  and  emis¬ 
sion  lines:  Whole  cell  blue  stain:  25  mW  Argon  laser  (458/488/ 
514  nm)  and  490-560  nm  emission  filter.  Cy3:  20  mW  DPSS  laser 
(561  nm)  and  563-593  nm  emission  filter. 

Data  analysis.  Super- resolution  DNA- PAINT  images  were  recon¬ 
structed  using  custom -programmed  software  written  in  either 
LabVIEW15-17  or  MATLAB21  for  spot -finding  and  2D- Gaussian 
fitting17.  Drift  correction  and  super-resolution  imaging  analysis 
for  the  20  nm  grid  origami  sample  was  performed  using  custom 


MATLAB  software  to  visually  separate  the  binding  sites21. 
Subsequent  qPAINT  analysis  w  as  performed  using  a  custom- written 
software  implemented  in  Lab  VIEW.  Both  reconstruction  and 
analysis  programs  are  available  upon  request.  After  single- 
molecule  reconstruction,  the  qPAINT  analysis  software  uses  a 
region  of  interest  as  input  (either  automatically  or  interactively 
selected)  for  the  binding  kinetics  analysis.  The  software  determines 
the  mean  dark  time  T<pby  first  creating  a  cumulative  distribution 
function  (cdf )  from  all  dark  times  t  and  subsequently  fitting  this 
distribution  to  a  single  exponential  function15: 

P(t)~  l-expt-t/T^*), 

with  P  representing  the  binding  probability  for  a  DNA- PAINT 
imaging  probe  at  time  t  after  a  previous  unbinding  event. 
Finally — after  calibrating  the  imager  probe  influx  rate  £  -  kon  x  q 
from  a  known  calibration  sample  (Le.  previously  established  from 
experiments  under  similar  conditions  or  in  situ  DNA  origami  cali¬ 
bration  standards) — the  number  of  binding  sites  in  the  selected 
region  of  interest  is  obtained  by 

binding  sites  -  — - — 

qPAINT  data  simulation.  In  silica  qPAINT  experiments 
were  performed  using  COPASI35.  In  brief,  a  bimolecular 
chemical  reaction  model — modeling  DNA  hybridization  and 
dissociation  reactions — with  two  species  (docking  and  imager 
strands)  and  twTo  rate  constants  (association  rate  kon  and 
dissociation  rate  fc0ff)  was  used.  Using  a  direct  stochastic  solver, 
we  performed  time-lapsed  simulations  that  yielded  binarized 
single-molecule  intensity  versus  time  traces.  These  traces  were 
then  subjected  to  the  same  qPAINT  analysis  software  that  was 
used  to  analyze  the  experimental  data  (see  above).  Simulation 
parameters  were  selected  in  accordance  to  the  respective 
experimental  conditions. 

Simulation  conditions:  for  the  simulation  in  Figure  lb,  two 
probe  influx  rates  (0.005  s"1  and  0,01  s^1)  wTere  used,  corresponding 
to  5  nM  and  10  nM  imager  strand  concentration  at  a  constant 
association  rate  kon  of  I06  M~  ls_1.  Simulations  were  run  for  i  5,000 
"‘frames”  at  a  sampling  interval  (or  ‘"integration"  time)  of  0.1  s. 
200  simulations  per  data  point  were  performed  (mean  and  s.d. 
are  plotted  in  Fig.  lb).  More  detailed  simulation  conditions 
for  simulations  accompanying  experimental  data  are  given  in  the 
following  paragraphs, 

DNA  origami  self-assembly.  DNA  origami  structures  displaying 
12, 44, 48  and  150  DNA- PAINT  docking  sites  were  self-assembled 
in  a  one-pot  reaction  with  50  |il  total  volume  containing  10  nM 
scaffold  strand  M 1 3mp  18, 100  nM  folding  staples,  1 00  nM  biotin¬ 
ylated  staples,  and  1  (llM  DNA-PA1NT  docking  staple  strands  in 
folding  buffer  (lx  TE  Buffer  with  12,5  mM  MgCL).  The  solution 
was  annealed  using  a  thermal  ramp  cooling  from  90  rtC  lo  20  °C 
over  the  course  of  1 .5  h.  After  self-assembly,  structures  were  puri¬ 
fied  by  agarose  gel  electrophoresis  (2%  agarose,  0.5x  TBE,  1 0  mM 
MgCl2,  0.5x  SybrSafe)  at  4.5  V/cm  for  1 .5  h.  Ge!  bands  were  cut, 
crushed  and  filled  inLo  a  Freeze  £N  Squeeze  column  and  spun 
for  5  min  at  800  x  g  at  4  °C.  After  this*  structures  were  ready  for 
microscopy  sample  preparation  and  image  acquisition. 
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DNA  sequences,  20  nm  grid  structures  can  be  found  in 
Supplementary  Table  L  Staple  sequences  for  12  docking  sites 
origami  can  be  found  in  Supplementary  Table  2.  Staple  sequences 
for  48  docking  sites  origami  can  be  found  in  Supplementary 
Table  3.  Staple  sequences  for  150  docking  sites  origami  can  be 
found  in  Supplementary  Table  4,  Staple  sequences  for  20  nm  grid 
structures  with  fixed  Cy3  dyes  can  be  found  in  Supplementary 
Table  5.  Staple  sequences  for  44  docking  sites  origami  with  fixed 
Cy3  dyes  can  be  found  in  Supplementary  Table  6,  M!3mpl8 
scaffold  sequence  for  DNA  origami  structures  can  be  found  in 
Supplementary  Table  7.  Sequences  for  FISH -PAINT  probes  can 
be  found  in  Supplementary  Table  8,  DNA- PAINT  docking  and 
imager  sequences  and  biotin  docking  sequence  can  be  found  in 
Supplementary  Table  9. 

Sample  preparation,  imaging,  and  analysis  of  DNA  origami 
structures.  Sample  preparation .  For  sample  preparation,  a  piece 
of  coverslip  (No,  1.5,  18x18  mm2,  -0.17  mm  thick)  and  a  glass 
slide  (3  x  1  inch2, 1  mm  thick)  were  sandwiched  together  by  two 
strips  of  double-sided  tape  to  form  a  flow  chamber  with  inner 
volume  of  -20  jil.  First,  20  jil  of  biotin-labeled  bovine  albumin 
( 1  mg/ml,  dissolved  in  buffer  A)  was  flown  into  the  chamber  and 
incubated  for  2  min.  The  chamber  was  then  washed  using  40  jil  of 
buffer  A.  20  jil  of  streptavidin  (0*5  mg/ml,  dissolved  in  buffer  A) 
was  then  flown  through  the  chamber  and  allowed  to  bind  for 
2  min.  After  washing  with  40  pi  of  buffer  A  and  subsequently  with 
40  pi  of  buffer  B,  20  jil  of  biotin  labeled  DNA  structures  (-300  pM 
concentration)  in  buffer  R  were  finally  flown  into  the  chamber 
and  incubated  for  5  min.  The  chamber  was  washed  using  40  pi  of 
buffer  R+  for  in  vitro  imaging  of  20  nm  grid  structures  and  buffer  B 
for  all  other  in  vitro  DNA  origami  experiments* 

Imaging  conditions*  The  imaging  buffer  contained  10  nM 
(Fig.  2b,e)  or  15  nM  {Fig.  2c,d)  Cy3B-labeled  imager  strands  in 
buffer  B+,  or  20  nM  Atto  655-labeled  imager  strands  in  buffer  B 
(Supplementary  Fig.  8).  Imaging  chambers  were  sealed  with 
epoxy  before  Imaging.  Image  acquisition  was  carried  out  with 
a  CCD  readout  bandwidth  of  3  MHz,  at  14  bit  and  5,1  pre-amp 
gain.  No  EM  gain  was  used.  Imaging  was  performed  using  TIR 
illumination  with  an  excitation  intensity  of  -5  mW  using  the 
561  nm  laser  line  (Fig.  2b-e)  or  -40  mW  using  the  647  nm  laser  line 
(Supplementary  Fig,  8),  15,000  frames  at  10  Hz  frame  rale  were 
acquired  (Fig.  2b,e,h  and  Supplementary  Fig.  8).  50,000  frames  at 
5  Hz  frame  rate  were  acquired  (Fig.  2c  green  data  points). 

Data  analysis .  Super- resolution  images  were  reconstructed 
in  either  custom  Lab  VIEW  or  MATLAB  software  as  described 
above,  qPAINT  analysis  on  DNA  origami  structures  was  per¬ 
formed  interactively  by  selecting  a  region  of  interest  that  wTas 
large  enough  to  fully  comprise  a  single  DNA  origami  (compare 
gray  "ROIs”  (regions  of  interest)  in  Fig.  2b),  First,  100  random 
DNA  origami  structures  displaying  the  same  number  of  binding 
sites  (i.e.  10  spots)  were  used  to  calibrate  the  probe  influx  rale 
For  the  imaging  conditions  used,  a  probe  influx  rate  of  0,02  s_J 
was  determined  (Fig,  2c, f,  orange  data  points,  and  Fig.  2h)  and 
0.03  s~l  (Fig.  2c,  green  data  points).  These  influx  rates  were  then 
used  to  perform  qPAINT  analysis  on  all  DMA  origami  structures 
in  the  sample.  qPAINT  analysis  for  the  sample  in  Supplementary 
Figure  8  was  performed  similarly.  Here,  the  probe  influx  rate  was 
determined  by  assuming  thaL  the  L2  binding  site  origami  struc¬ 
tures  carry  on  average  10.5  binding  sites  (determined  by  visual 
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counting  of  similar  structures  in  Fig.  2b).  Here,  a  probe  influx  rate 
of  0.0201  s-1  was  determined  for  the  imaging  conditions  used  in 
this  experiment. 

Simulation  conditions.  qPAINT  simulations  for  Figure  2c 
were  performed  using  9,  10,  11,  or  12  bindings  sites  as  model 
input  in  combination  with  association  and  dissociation  rates 
obtained  from  the  experimental  data.  qPAINT  simulations  used 
in  Supplementary  Figure  8  were  performed  in  a  similar  fashion, 
here,  however,  using  a  normal  distribution  of  binding  sites  with 
means  determined  from  the  experimental  data,  s.d.  of  the  nor¬ 
mal  distribution  were  scaled  based  on  the  simulated  mean  value 
taking  the  12  binding  site  data  from  Figure  2b  as  input.  qPAINT 
simulations  for  Figure  2g  were  performed  similarly  to  simulations 
in  Figure  lb.  Here,  binding  sites  ranging  from  2  to  200  binding 
sites  and  probe  influx  rates  ranging  from  0.001  s"1  to  0.05  s“ 1 
were  used  to  perform  200  stochastic  simulations  per  value  pair. 
After  analysis,  the  coefficient  of  variation  (cv  =  o/ji),  reflecting 
the  counting  error,  is  plotted  against  the  number  of  binding  sites 
and  probe  influx  rate.  As  before,  15,000  frames  at  an  “integration 
time”  of  0.1  s  are  simulated. 

Sample  preparation,  acquisition,  and  analysis  of  artificial 
receptor  clusters  on  cell  surfaces.  Sample  preparation,  CHO 
cells  were  transiently  transfected  with  the  construct  encoding  a 
human  wild- type  EGFR  fused  with  the  fluorescent  protein  mEos 
(pQCEGFRmEOS;  a  kind  gift  from  Jeff  Werbin,  Harvard  Medical 
School)  using  Lipofectamine  (Invitrogen)  according  to  the  manu¬ 
facturers  instructions.  Briefly,  for  each  transfection  reaction,  10  |iL 
Lipofectamine  wTas  incubated  with  250  jiLGpti-MEM  for  10  min. 
Next,  4  |Ug  of  EGFR- mEos  vector  plasmid  was  added  and  incu¬ 
bated  for  30  min.  This  mixture  was  added  to  cells  and  incubated 
overnight.  Transfections  were  carried  out  when  CHO  cells  were  at 
90%  confluence.  After  24  h,  approximately  30%  confluence  cells 
per  well  were  seeded  into  Lab-Tek  II  chambers  for  24  h  before 
fixation.  EOF  receptors  were  immunostained  using  the  following 
procedure:  washing  in  PBS;  fixation  in  a  mixture  of  3%  para¬ 
formaldehyde  and  0.1%  glutaraldehyde  in  PBS  for  10  min;  three 
washes  with  PBS;  reduction  with  -1  mg/ml  NaBH4  for  7  min; 
three  washes  with  PBS;  permeabilization  with  0.25%  (v/v)  Triton 
X-10Q  in  PBS  for  10  min;  three  washes  with  PBS;  blocking  with 
3%  (w/v)  bovine  serum  albumin  for  30  min  and  staining  over 
night  with  the  biotinylated  EGFR  monoclonal  antibody  (diluted 
to  10  jig/ in  I  in  5%  BSA);  three  washes  with  PBS;  incubation  with 
20  streptavidin  (dissolved  in  5%  BSA)  for  30  min;  three 

washes  with  PBS;  incubation  with  -0.4  nM  of  biotinylated  Cy3- 
Jabeld  DNA  origami  structures  (dissolved  in  5%  BSA)  for  1  h; 
three  washes  with  PBS.  Finally,  imager  strands  were  added  in 
buffer  C  for  image  acquisition. 

Imaging  conditions.  For  surface -bound  origami  in  Supplementary 
Figure  9,  12  nM  Atto  655 -labeled  imager  strands  in  buffer  C  were 
used.  The  CCD  readout  bandwidth  was  set  to  5  MHz  at  16  bit 
and  5.1  pre-amp  gain  with  255  EM  gain  was  used.  Imaging  was 
performed  using  highly  inclined  (HILO)  illumination36  writh  an 
excitation  intensity  of  -23  mW  using  the  647  nm  laser  line. 

Data  analysis.  Super-resolution  images  were  reconstructed 
as  described  above.  Here,  ROI  selection  for  qPAINT  was  per¬ 
formed  in  a  semi- automated  fashion.  In  brief,  the  co-localization 
between  diffraction -limited  and  DNA- PAINT  images  was  used  to 
select  the  structures  (ROIs)  bound  to  the  membrane  of  the  cells, 
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Then,  diffraction-limited  and  super- resolved  images  were 
aligned  by  computing  the  cross-correlation  between  them  and 
discarding  non -co -localizing  molecules.  After  selection)  qPAINT 
quantification  was  performed  as  described  above.  As  before,  the 
probe  influx  rate  was  determined  by  assuming  that  the  12  bind¬ 
ing  site  origami  structures  carry  on  average  10.5  binding  sites 
(determined  by  visual  counting  of  similar  structures  in  Fig.  2b). 
Here,  a  probe  influx  rate  of  0.02163  s" 1  was  determined,  result¬ 
ing  in  an  association  rate  of  kon  =  1.78  x  106  M-]s-1  under  Lhese 
imaging  conditions. 

Sample  preparation,  acquisition,  and  analysis  of  DNA  ori¬ 
gami  microinjected  into  fixed  cells.  Sample  preparation.  DNA 
origami  structures  with  44  binding  sites  for  DNA-PAINT  and 
fixed  Cy3 -labeled  strands  were  injected  into  fixed  Hela  cells 
using  a  Femtojet  (Eppendorf,  NY).  Multiple  injections  per  cell 
were  performed  into  nuclear  and  cytoplasmic  regions  with  the 
origami  sample  at  ~  1  nM  concentration.  The  incorporation  of  the 
structures  inside  the  cells  was  confirmed  by  eonfocal  microscopy 
(Supplementary  Video  1)  and  by  deconvolution  of  a  wide- field 
image  stack  using  Huygens  Professional  image  processing  soft¬ 
ware  (Supplementary  Video  2), 

Imaging  conditions.  For  microinjected  DNA  origami  in 
Supplementary  Figure  9,  5  nM  655-labeled  imager  strands  in 
buffer  C  were  used.  The  CCD  readout  bandwidth  was  set  to 
3  MHz  at  14  bit  and  5.1  pre-amp  gain  with  no  EM  gain  was  used. 
Imaging  was  performed  using  HILO  illumination  with  an  excita¬ 
tion  intensity  of  ~50  mW  using  the  647  nm  laser  line. 

Data  analysis.  Super- resolution  DNA- PAINT  images  were 
reconstructed  as  described  above.  For  qPAINT  analysis,  DNA 
origami  structures  in  the  cytoplasm  were  separated  from 
structures  in  the  nucleus  by  using  the  DAPI  signal  for 
segmentation  (Supplementary  Fig.  9).  After  structure  selection, 
quantification  was  done  as  described  above  in  order  to  determine 
hybridization  kinetics  for  DNA- PAINT  probes  in  nuclear  and 
cytoplasmic  regions.  For  calculating  Lhe  number  of  binding  sites, 
the  same  association  rate  determined  in  experiments  on  cell 
surfaces  (kon  -  1.78  x  10^  M_1s_l)  was  used,  resulting  in  a  probe 
influx  rate  of  0.0089  s^L  We  used  the  same  apparent  association 
rate,  as  imaging  and  buffer  conditions  are  the  same  as  in  the  cell 
surface  experiments. 

Sample  preparation,  image  acquisition,  and  analysis  of  qPAINT 
applied  to  NPGs.  Sample  preparation .  The  nudeoporin  Nup98 
was  labeled  using  a  custom  DNA-conjugated  monoclonal  anti¬ 
body.  In  brief,  first  a  5'-thiolated  DNA- PAINT  docking  strand 
(pi  sequence)  was  reduced  with  DTT  and  purified  using  NAP5 
columns  from  GE  Healthcare  (Pittsburgh)  PA).  In  a  second  step, 
the  monoclonal  antibody  (#2598,  Cell  Signaling)  was  cross- 
linked  with  a  50x  excess  of  a  male  i  ma  i  de- PEG- sued  nimidyl 
ester  crosslinker  (746223,  Sigma- Aldrich).  Excess  of  linker  that 
did  not  react  with  antibody  wTas  removed  using  Zeba  columns 
(898S3,  Life  Technologies).  Third,  the  anti  body  Tinker  was  reacted 
with  the  reduced  S'-thiolated  DNA- PAINT  docking  strand  at  4C 
overnight.  Finally,  after  conjugation,  the  oligo-labeled  antibodies 
were  purified  by  using  100  kDA  Amicon  ultra  filter  (UPC5 10096, 
Emdmillipore)  from  unreacted  DNA.  The  sample  was  then  char¬ 
acterized  by  mass  spectrometry,  determining  that  on  average  each 
antibody  is  coupled  with  two  docking  strands. 


U20S  cells  were  cultured  in  DMEM  supplemented  with  10% 
FB$,  Immunostaining  was  performed  using  a  standard  proto - 
col3'  .  Cells  were  grown  overnight  on  8 -well  Lab-Tek  Chambered 
Coverglass.  The  cells  were  then  rinsed  with  2.4%  PFA  for  20  s  in 
PBS  and  extracted  with  0.4%  Triton  X- 100  in  PBS  for  3  min.  Next, 
the  cells  were  fixed  for  30  min  with  2.4%  PFA  in  PBS,  quenched 
for  5  min  with  50  mM  NH4C1,  then  extensively  washed  with  PBS 
(3x,  10  min  each),  and  blocked  with  5%  BSA  in  PBS  (blocking 
buffer)  for  1  h.  Subsequently,  the  cells  were  incubated  overnight 
at  4  °C  with  the  DNA-conjugated  monoclonal  antibody  freshly 
diluted  in  the  blocking  buffer  and  then  incubated  with  DAPI 
plus  drift  markers  for  5  min.  Then  cells  were  washed  once  with 
buffer  C  before  imaging  and,  finally,  imager  strands  were  added 
in  buffer  C  for  image  acquisition. 

Imaging  conditions.  For  Figure  3a,  5  nM  Cy3B- labeled  imager 
strands  in  buffer  C  were  used.  The  CCD  readout  bandwidth  was 
set  to  3  MHz  at  14  bit  and  5.1  pre-amp  gain  with  no  EM  gain. 
Imaging  was  performed  using  HILO  illumination  with  an  excita¬ 
tion  intensity'  of -8  mW  using  the  561  nm  laser  line.  15,000  frames 
at  5  Hz  frame  rate  were  acquired. 

Data  analysis.  Super- resolution  DNA- PAINT  images  wTere 
reconstructed  as  described  above.  For  qPAINT  analysis,  NPC 
structures  in  the  nucleus  were  masked  from  the  signal  in  the 
cytoplasm  using  the  DAPI  signal.  Single  Nup98  proteins  or  com¬ 
plete  structures  were  masked  using  particle  filter  and  automatic 
structure  selection.  Quantification  was  done  as  described  above 
in  order  to  determine  hybridization  kinetics  for  DNA- PAINT 
probes.  The  association  rate  was  determined  by  using  the  mean 
value  from  the  Gaussian  fitting  of  the  dark  time  distribution  from 
lhe  individual  protein  locations  (see  arrows  in  Fig.  3a)  to  kon  - 
1.75  x  IQ6  M_1s~l,  resulting  in  a  probe  influx  rate  of  0.00875  s_l. 
This  value  was  then  used  to  perform  all  qPAINT  calculations. 

Distances  between  each  individual  Nup98  protein  In  an  NPC 
were  determined  by  first  calculating  the  coordinates  of  the  center 
for  each  structure  using  circular  Hough  transform.  Then  we 
projected  the  histograms  for  the  duster  oflocalization  from  the 
center  forming  a  linearized  intensity  profile  with  a  given  number 
of  peaks  (3 — 7  spots,  see  Supplementary  Fig.  10).  Distances 
between  spots  were  calculated  by  fitting  the  histograms  with  min¬ 
imal  number  of  Gaussians,  and  calculating  the  intervals  between 
the  peaks.  This  routine  was  implemented  using  custom-written 
MATLAB  software. 

Sample  preparation,  image  acquisition,  and  analysis  of  qPAINT 
for  quantifying  Brp  proteins  in  CAZ-units.  Sample  preparation. 
The  Brp  protein  was  labeled  using  a  monoclonal  antibody  Brp^tfS2 
(nc82,  DSHB).  For  DNA- PAINT,  we  used  custom  secondary  anti¬ 
mouse  goat-DNA-fabeled  antibodies.  The  secondary  antibody 
was  labeled  using  the  same  protocol  used  for  the  conjugation  of 
the  Nup98  antibody. 

Drosophila  crawling  third  instar  larvae  fillets  were  prepared 
by  dissection  in  ice-cold  HL-3.1  and  fixed  for  10  min  using  4% 
paraformaldehyde  in  HL-3.1.  The  fillets  were  then  permea- 
bilized  with  3  washes  of  15  min  each  in  PBT  (PBS  with  0.05% 
Triton  X-1Q0)  followed  by  a  30  min  block  in  PBT  containing  1% 
normal  goat  serum  and  2%  BSA.  Preparations  were  incubated 
with  primary  antibodies  at  4  °C  overnight.  After  one  short  and 
three  -20  min  washing  steps,  the  fillets  were  incubated  with  sec¬ 
ondary  DNA  conjugated  antibodies  for  2  h  followed  by  another 
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three  washing  steps*  Then  fillets  were  washed  once  with  buffer  C 
before  imaging  and,  finally,  imager  strands  were  added  in  buffer 
C  for  image  acquisition. 

Imaging  conditions.  For  Figure  3d,  10  nM  Cy3R-labeled  imager 
strands  in  buffer  C  were  used*  The  CCD  readout  bandwidth  was 
set  to  3  MHz  at  14  bit  and  5.1  pre-amp  gain  with  no  EM  gain. 
Imaging  was  performed  using  HILO  illumination  with  an  excita¬ 
tion  intensity  of  ~8  mW  using  the  561  nm  laser  line.  15,000  frames 
at  10  Hz  frame  rate  were  acquired. 

Data  analysis *  Super- resolution  DNA-PA1NT  images  were 
reconstructed  as  described  above.  For  qPAINT  analysis*  DNA- 
PA1NT  signal  from  synaptic  boutons  on  neuromuscular  junc¬ 
tions  (NMJ)  were  masked  using  anti  horseradish  peroxidase 
(HRP-A488)*  Single  Brp  molecules  or  complete  CAZ-units  were 
masked  using  particle  filter  and  automatic  structure  selection. 
Quantification  was  done  as  described  above  in  order  to  determine 
hybridization  kinetics  for  DNA-PAINT  probes*  The  association 
rate  was  determined  by  using  the  mean  value  from  the  Gaussian 
fitting  of  the  dark  time  distribution  from  the  individual  Brp  mol¬ 
ecule  locations  (see  Fig*  3d  green  circles)  resulting  in  a  probe 
influx  rate  of  0.0023  s_l.  This  value  was  then  used  to  perform  all 
qPAINT  calculations  on  individual  CAZ-units.  This  routine  was 
implemented  using  custom -written  MATLAB  software. 

Sample  preparation,  acquisition,  and  analysis  of  smRNA-FISH 
in  fixed  cells*  Sample  preparation.  SUZ12  mRNAs  molecules 
were  tagged  in  fix  HeLa  cells  using  single-stranded  oligonucle¬ 
otides  in  a  FISH-like  hybridization  scheme.  mRNAs  were  labeled 
with  a  unique  set  of  64  DNA  single  strand  oligonucleotides 
(Supplementary  Table  8).  Each  probe  consists  of  a  unique  binding 
sequence  to  the  mRNA  (20  nucleotides  length),  Cy3B  coupled  to 
the  5'-end.  The  3' -end  carries  a  single-stranded  extension  for  DNA- 
PAINT  (Fig*  3f).  For  coupling,  the  oligonucleotides  were  ordered 
with  5'-amino  modifications  from  IDT  (Coralville,  IA).  Cy3R  was 
ordered  as  a  succinimidyl  ester  derivative  from  GE  Healthcare 
(Pittsburgh,  PA).  After  coupling,  the  dyedabeled  DNA  strands 
were  purified  by  using  high  pressure  liquid  chromatography23. 
To  perform  RNA-FISH,  we  followed  a  standard  protocol29. 
Cells  were  fixed  with  pre -chilled  methanol  (-20  °C)  for  30  min. 
Following  fixation,  cells  were  hybridized  with  the  probes  at 
16  mM  each  in  hybridization  buffer  consisting  of  10%  forma- 
mide,  2x  SSC,  and  10%  dextran  sulfate  (w/v).  Samples  were  then 
hybridized  overnight  in  a  humidified  chamber  at  37  QC.  Following 
hybridization,  samples  were  washed  twice  with  wash  buffer 
(2x  SSC  with  10%  formamide)  for  30  min  at  37  °C,  and  then 
incubated  with  DAPI  plus  drift  markers  for  5  min*  Then  cells  were 
washed  once  with  buffer  C  before  imaging  and,  finally,  imager 
SLrands  were  added  in  buffer  C  for  image  acquisition. 


Imaging  conditions.  For  Figure  3f,  5  nM  Atto  655-labeled  imager 
strands  in  buffer  C  were  used.  The  CCD  readout  bandwidth  was 
set  to  3  MHz  at  14  bit  and  5*1  pre -amp  gain  with  no  EM  gain  was 
used*  Imaging  was  performed  using  HILO  illumination  with  an 
excitation  intensity  of  ~50  mW  using  the  647  nm  laser  line. 

Image  analysis *  Super-resolution  DNA-PAINT  images  were 
reconstructed  as  described  above.  Here,  Bright  field  and  DAPI 
images  were  used  to  identify  individual  cells.  Diffraction- 
limited  Cy3R  and  DNA- PAINT  images  were  used  to  detect  RNA 
transcripts,  for  image  processing,  a  binary  brightfield  image  was 
merged  with  a  binary  DAPI  image.  The  resulting  mask  was  used 
to  segment  the  cells.  Diffraction-limited  spots  were  detected  using 
a  script  that  operates  as  follows:  First,  a  median  filter  followed  by  a 
Laplacian  filter  was  applied  to  each  image.  Then,  a  threshold  was 
selected  to  detect  individual  spots.  This  procedure  enabled  us  to 
identify  the  total  number  of  diffraction -limited  spots  within  each 
cell*  For  quantification  of  diiTraction -limited  and  super- resolved 
images,  they  were  aligned  by  computing  the  cross-correlation 
between  them.  After  selection,  qPAINT  quantification  was  per¬ 
formed  as  described  above  for  each  individual  spot  in  order  to 
determine  the  number  of  probes  bound  to  each  indi  vidual  mRNA, 
For  calculating  the  number  of  binding  sites,  the  same  association 
rate  determined  previously  ( kon  ~  L78  x  106  M_1s_I)  was  used, 
resulting  in  a  probe  influx  rate  of  0.0089  s-1.  We  used  the  same 
apparent  association  rate  as  imaging  and  buffer  conditions  are  the 
same  as  in  the  experiments  in  Supplementary  Figure  9. 

Drift  correction*  In  vitro  imaging.  Drift  correction  was  performed 
with  the  custom- written  MATLAB  software21.  The  positions  of  all 
DNA  origami  structures  were  tracked  throughout  the  duration 
of  each  movie  and  averaged  for  use  as  the  drift  correction  trace. 
In  the  20  nm  grid  super-resolution  image  (Fig*  2b),  an  advanced 
drift  correction  algorithm21  was  performed  to  visually  separate 
the  individual  grid  points. 

In  situ  imaging.  For  cellular  imaging,  100  nmgold  nanoparticles 
(Sigma- Aldrich;  10  nM  in  buffer  C,  added  before  imaging)  were  used 
as  drift  markers.  The  gold  nanoparticles  adsorb  non -specifically 
to  the  glass  bottom  of  the  imaging  chambers.  Drift  correction 
is  performed  in  a  similar  fashion  as  for  the  in  vitro  imaging  (see 
above)*  The  apparent  “movement”  of  all  gold  nanoparticles  in  a 
field  of  view  is  tracked  throughout  the  movie*  The  obtained  tra¬ 
jectories  are  then  averaged  and  used  for  global  drift  correction  of 
the  final  super- re  solution  image* 

35.  Hoops.  S.  e£  cl.  Bioin formctics  22,  3067-3074  (2006)* 

36.  Tokunaqa.  M.r  Imamoto.  N.  &  Sakata-Soqawa,  K,  Nat.  Methods  5,  159-161 
(2008). 

37.  Szymborska,  A.  et  at.  Science  341,  655-658  (2013). 
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Optical  imaging  of  individual  biomolecules 
in  densely  packed  clusters 

Mingjie  Dai1-2,  Ralf  Jungmann1>3t  and  Peng  Yin1-3* 

Recent  advances  in  fluorescence  super-resolution  microscopy  have  allowed  subcellular  features  and  synthetic  nanostructures 
down  to  10-20  nm  in  size  to  be  imaged*  However,  the  direct  optical  observation  of  individual  molecular  targets  (-5  nm)  in 
a  densely  packed  biomolecular  cluster  remains  a  challenge.  Here,  we  show  that  such  discrete  molecular  imaging  is  possible 
using  DNA-PAINT  (points  accumulation  for  imaging  in  nanoscale  topography)— a  super-resolution  fluorescence  microscopy 
technique  that  exploits  programmable  transient  oligonucleotide  hybridization— on  synthetic  ON  A  nanostructures.  We 
examined  the  effects  of  a  high  photon  count,  high  blinking  statistics  and  an  appropriate  blinking  duty  cycle  on  imaging 
quality,  and  developed  a  software -based  drift  correction  method  that  achieves  <1  nm  residual  drift  (root  mean  squared) 
over  hours.  This  allowed  us  to  image  a  densely  packed  triangular  lattice  pattern  with  -5  nm  point-to-point  distance  and  to 
analyse  the  DNA  origami  structural  offset  with  angstrom -level  precision  (2  A)  from  single-molecule  studies.  By  combining 
the  approach  with  multiplexed  exchange-PAINT  imaging,  we  further  demonstrated  an  optical  nanodisplay  with  5*5  nm 


pixel  size  and  three  distinct  colours  with  <1  nm  cross-channel 

Biological  and  synthetic  biomolecular  systems  exhibit  complex 
structures  at  the  nanoscale.  Understanding  the  spatial  arrange¬ 
ment  of  their  individual  components  is  critical  for  unravelling 
the  molecular  mechanism  that  underlies  complex  molecular 
behaviour  (Fig,  la).  Super- resolution  fluorescence  techniques  have 
bypassed  the  traditional  diffraction  limit  and  demonstrated 
imaging  resolution  down  to  10-20  nm  (refs  t  — 13).  In  particular, 
single- molecule  localization  microscopy  (SMLM)  builds  up  super¬ 
resolution  images  from  single- emit  ter  localizations  and  typically 
achieves  photon- limited  localization  precision  (down  to  -1  nm) 
for  single-emitter  blinking  events13-15.  Previous  single- molecule 
and  SMLM  studies  separately  demonstrated  single- target  visualiza¬ 
tion  in  isolation  or  in  sparse  arrangements16,17,  and  high  localization 
precision  compatible  with  molecular- scale  resolution18  24.  However, 
the  discrete  visualization  and  precise  localization  of  each  indivi¬ 
dual  molecular  target  (^5  nm)  in  a  densely  packed  biomolecular 
duster,  which  we  refer  to  as  "discrete  molecular  imaging'  (DMI), 
remains  difficult. 

Several  factors  limit  the  performance  of  current  super- resolution 
techniques,  such  as  the  finite  fluorophore  photon  budget,  unsatisfac¬ 
tory  fluorophore  imaging  efficiency,  or  limited  control  over  target 
blinking  kinetics9,22,2^30.  These  restrictions  respectively  translate  to 
a  limited  photon  count  per  localization,  a  limited  number  of 
blinking  events  per  target  and  a  high  fraction  of  false  localizations, 
which  ultimately  restrict  the  final  imaging  resolution,  signal -to- 
noise  ratio  and  the  visualization  of  individual  targets  within  dense 
clusters.  Moreover  nanometre -level  accuracy  stage  noise4' and  drift 
compensation  is  critical  for  high  imaging  resolution  and  quality. 

We  addressed  the  above  challenge  and  demonstrated  DM!  using 
the  DNA-PAINT  super- resolution  method t,U33UJ3  (Supplementary 
Fig,  1).  DNA-PAINT,  a  variation  of  PAINT9,  exploits  the  transient 
binding  of  fluorophore-labelled  imager  strands  to  target-bound 
docking  strands  to  achieve  the  necessary  blinking  for  super¬ 
resolution  reconstruction10  (Fig.  lb;  Supplementary  Fig.  2),  The 
continuous  replenishment  of  imager  strands  renders  DNA-PAINT 
immune  to  photo  bleaching,  allowing  high  localization  precision 


registration  accuracy. 

by  extracting  a  large  number  of  photons  per  single-molecule 
localization  and  a  high  target  separability  by  collecting  a  large 
number  of  blinking  events  from  each  target.  Additionally,  due  to 
independent  and  programmable  control  of  blinking  on-  and  off- 
rates,  DNA-PAINT  permits  low  imaging  background  in  dense 
clusters  from  appropriately  adjusted  blinking  duty  cycle  based  on 
the  target  density  (Fig.  ic). 

We  developed  a  framework  for  achieving  DM!  in  the  context  of 
localization  microscopy,  including  the  technical  requirements  and 
quality  assay  methods  (Supplementary  Fig.  I).  Specifically,  we  intro¬ 
duced  an  image -based  assay  for  measuring  the  localization  precision 
and  the  maximally  achievable  resolution,  a  target  signal -to- noise 
ratio  assay  for  measuring  single- target  separation  and  a  method 
for  estimating  the  fraction  of  false  double-blinking  localizations. 
We  systematically  studied  their  effects,  and  demonstrated  stringent 
control  for  each  of  them  with  DNA-PAINT.  In  particular,  we 
achieved  a  high  localization  precision  (<1  nm  single-molecule 
fitting  precision,  from  up  to  50,000  photons  per  single-molecule 
localization),  a  high  target  separability'  (from  -80  blinking  events 
per  target),  a  low  imaging  background  using  appropriately  tuned 
blinking  duty  cycles  and  high-accuracy  (d  nm  root  mean 
squared  (rms.)  over  hours  of  imaging)  microscope  stage  drift 
correction  with  a  novel  method  based  on  synthetic  nanostructure 
drift  markers  with  designed  geometric  patterns.  Finally,  we  used 
DM1  to  visualize  individual  targets  in  a  compactly  labelled  mole¬ 
cular  grid  of  targets  (with  a  point-to-point  spacing  of  ~5  nm)  and 
demonstrated  multiplexed  DM  I  on  a  three-colour  nanodisplay 
board  with  -5  nm  pixels. 

Technical  requirements  for  DMI 

The  technical  requirements  for  DMI  depend  on  the  spatial  distri¬ 
bution  of  the  targets,  particularly  the  closest  spacing  between 
targets  and  their  local  density2634  (within  a  diffraction- limited 
region,  see  Supplementary  Note  7.1),  Even  with  only  two  targets, 
a  high  localization  precision  that  allows  a  full  width  at  half 
maximum  (FWHM)  resolution  equal  to  or  smaller  than  the 
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Figure  1  |  Principle  and  requirements  of  DM!*  a.  The  concept  of  super-resolution  DMI,  illustrated  with  a  point  array  representation  (blue  dots  represent 
individual  molecular  targets,  yellow  dots  represent  chemical  modifications).  Left:  A  regular  16-component  biomdecular  complex.  Right:  Its  various  structural 
and  chemical  variations,  b,  Illustration  of  the  DN  A- PA  I  NT  principle:  transient  bending  between  a  docking  strand  and  dye-conjugated  imager  strands  (top)  on 
a  synthetic  DNA  nanostructure,  where  each  cylinder  represents  a  DNA  double  helix  (bottom),  c.  Schematic  DNA-PAiNT  blinking  time  trace  of  a  single 
imaging  target,  The  three  blinking  characteristics  measure  the  blinking  amtime  ron  (1),  the  total  imaging  time  rimagp  (2)  and  the  blinking  off-time  (3),  and 
can  be  tuned  to  meet  the  three  blinking  requirements  in  e  d  Schematics  of  different  substructures  from  the  complex  in  a;  a  single  target  (left),  a  pair  of 
targets  in  close  proximity  (middle)  and  a  dense  lattice  (right).  All  need  different  blinking  requirements  in  e  to  be  dearly  visualized,  e,  Technical  requirements 
for  achieving  discrete  molecular  imaging.  Each  panel  outlines  one  technical  requirement  and  schematically  depicts  the  effect  on  imaging  quality  before  (left 
column)  and  after  (right  column)  the  requirement  is  satisfied.  For  requirement  (1)H  the  intensity  profile  in  10  (top),  Fitted  Gaussian  centres  in  ID  (middle)  and 
2D  (bottom)  are  shown.  For  requirements  (2),  (3)  and  (")  the  localization  time  trace  in  ID  (top),  localization  histogram  in  ID  (middle)  and  2D  (bottom)  are 
shown.  The  orange  lines  and  crosses  indicate  localizations.  The  orange  bars  depict  localization  histograms,  The  solid  red  lines  and  dotted  grey  lines  indicate 
successful  and  Failed  Gaussian  fittings  on  localization  histograms,  respectively.  For  (3),  grey  crosses  indicate  true  localizations  eclipsed  by  false  double- 
blinking  localizations.  The  same  numbering  for  technical  requirements  (1)-(3)  is  also  used  in  Fig.  2,  Table  1  and  Supplementary  Fig.  3.  f,g,  Simulations  of  the 
imaging  effects  of  the  technical  requirements  for  the  complex  in  a  under  increasingly  improved  imaging  conditions  without  stage  drift  (f )  or  under  non-ideal 
imaging  conditions  with  one  of  the  four  requirements  unsatisfied  (g).  See  Methods  and  Supplementary  Methods  2  for  simulation  details,  and  Supplementary 
Note  7  for  discussions.  Scale  bars,  5  nm  (e);  10  nm  (f,g), 

spacing  between  them  is  necessary  but  not  sufficient  for  their  dear  guidelines  for  super -resolution  imaging  have  been  discussed26,27^11*3*,  a 
separation23435.  Targets  arranged  in  dense  dusters  impose  even  more  systematic  formulation  of  the  technical  requirements  for  DMI  and 
stringent  requirements.  Although  general  requirements  and  quality  its  quality  control  methods  is  still  lacking. 
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Figure  2  J  Systematic  characterization  of  the  blinking  requirements  and  optimization  of  the  DNA-PAINT  imaging  quality,  a.  Methods  for  the  systematic 
characterization  of  the  three  blinking  requirements  depicted  in  Fig.  1.  (1)  DAFL  measures  the  distance  (At)  between  pairs  of  spatially  dose  idealizations 
originating  from  adjacent  camera  frames.  (23  Target  SNR  measures  the  separability  of  the  peaks  in  the  localization  histogram  in  the  super-resolved  image, 

S,  signal;  N,  noise.  The  red  curve  indicates  the  two-peak  Gaussian  fit.  (33  The  photon  count  cut-off  in  the  blinking  trace  measures  the  fraction  of  false 
totalizations.  The  blue  shaded  area  indicates  identified  false  localizations.  The  orange  markers,  bars  and  curves  indicate  localizations,  histograms  and  time 
traces,  respectively,  b-d.  Designed  DNA  origami  standards  with  10  nm  spacing  under  different  blinking  conditions.  The  left  column  shows  the  design 
schematics  of  the  DNA  origami  standards;  green  dots  indicate  DNA-PAINT  docking  strands;  four  corners  in  b,c  are  used  as  alignment  markers.  The  right 
columns  show  DNA-PAINT  images  under  increasingly  better  blinking  conditions  (one  condition  per  column).  The  histograms  below  the  images  show  the 
projection  profiles  from  the  areas  indicated  by  white  boxes  along  the  directions  of  arrows  er  Quantitative  characterization  and  pairwise  comparisons  of  the 
imaging  conditions  used  in  b-d  before  and  after  meeting  each  extra  requirement,  assayed  with  the  methods  in  a  For  each  comparison,  the  left  y  axis  (blue) 
shows  the  control  parameter  and  right  y  axis  (green)  shows  the  experimental  measurement.  For  more  details  see  Supplementary  Figs  4™6  on  origami 
designs,  Supplementary  Figs  7 “14  for  super- resolution  images.  Methods  and  Supplementary  Methods  3  and  5  for  DNA-PAINT  imaging  conditions  and 
analysis  methods.  Scale  bars,  10  nm  in  schematics  and  20  nm  in  super-resolution  images. 


We  examined  the  technical  requirements  for  DMI  with  an 
example  square  lattice  pattern  (Fig.  Id).  It  is  increasingly  more  diffi¬ 
cult  to  discretely  identify  and  precisely  position  a  single  isolated  target 
(Fig*  Id.  left),  a  pair  of  targets  in  dose  proximity  (middle)  and  finally 
a  dense  lattice  of  targets  ( right )t  thus  requiring  increasingly  more 
stringent  imaging  conditions  (Fig,  le),  These  four  conditions  are 
described  as  follows.  Requirement  (1):  A  high  localization  precision. 
This  can  be  obtained  by  collecting  a  high  photon  count  per  single  - 
molecule  localization,  and  allows  the  precise  localization  of  an  isolated 
target*  Requirement  (2):  A  high  target  signal-to-noise  ratio  (target 
SNR)  in  the  super-resolved  image*  This  can  be  achieved  by  collecting 
a  large  number  of  blinking  events  per  molecular  target,  and  allows  for 
a  clear  separation  between  two  nearby  targets.  Requirement  (3):  A 
low  fraction  of  false  localizations  from  double -blinking  events. 
This  can  be  achieved  by  using  a  low  blinking  on -off  duty  cycle, 
which  minimizes  falsely  localized  background  noise  and  hence 
allows  the  discrete  visualization  of  each  target  within  a  densely 
packed  complex*  Lastly,  requirement  (*);  An  accurate  compensation 


mechanism  for  microscope  stage  drift.  This  is  important  for  the 
accurate  identification  and  localization  of  any  molecular  structures 
of  interest,  especially  over  extended  imaging  times,  Computer- 
simulated  super  resolution  images  reveal  increasingly  better  image 
qualities  when  more  of  these  requirements  are  satisfied  (Fig.  If) 
and  that  each  of  the  four  requirements  is  indispensable  for  DMI 
(Fig,  Ig),  See  online  methods.  Supplementary  Method  2  and 
Supplementary  Note  7.2  for  details. 

We  termed  requirements  (l)-(3)  collectively  the  three  blinking 
requirements,  as  they  can  ali  be  met  by  appropriate  single-molecule 
blinking  properties  (see  Fig,  Lc).  As  DNA-PAINT  allows  flexible 
tuning  of  target  blinking  kinetics,  it  therefore  provides  a  promising 
route  for  implementing  DMI. 

Systematic  characterization  and  quality  control  for  DMI 

To  quantitatively  characterize  the  effects  of  the  above  DMI  require¬ 
ments  on  imaging  quality,  w*e  proposed  a  set  of  assay  methods 
(Fig.  2a):  (1)  an  image- based  assay  of  the  localization  precision 
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and  the  maximal  achievable  resolution  by  comparing  the  positions 
of  super-localized  centres  from  neighbouring  frames  (termed  the 
distance  between  adjacent -frame  localizations,  or  DAFL);  (2)  a 
target  SNR  assay  based  on  the  analysis  of  the  distribution  of 
super-localized  centres,  which  directly  measures  the  separability  of 
neighbouring  targets  in  super-resolved  images;  and  (3)  a  localiz¬ 
ation  time  trace-based  assay  for  estimating  the  false  localization 
ratio.  These  assay  methods  provide  a  general,  sample- agnostic 
method  for  the  stringent  quality  control  of  general  super-resolution 
microscopy  studies  as  well  as  DM1  (see  Methods,  Supplementary 
Fig.  3  and  Supplementary  Methods  3  for  details). 

Using  these  assay  methods,  we  first  simulated  super -resolution 
movies  with  varying  photon  counts,  numbers  of  blinking  events 
and  blinking  duty  cycles,  and  measured  the  resultant  single - 
molecule  localization  precision,  the  target  SNR  and  the  fraction  of 
false  localizations  from  double-blinking  events  (see  Methods, 
Supplementary  Fig,  3  and  Supplementary  Methods  2  and  3  for 
details).  We  observed  a  high  localization  precision  (<1  nm,  support¬ 
ing  <2  nm  FWHM  resolution)  with  a  high  photon  count  (>30,000), 
a  consistent  target  separation  with  a  high  target  SNR  (>2  under  our 
definition)  and  low  background  noise  under  a  low  fraction  of  false 
localizations  (down  to  <5%),  allowing  DMI  imaging. 

Applying  the  same  assay  methods,  the  three  blinking  require¬ 
ments  (localization  precision,  target  SNR  and  false  localization 
ratio)  for  DMI  were  each  experimentally  verified  by  subjecting  syn¬ 
thetic  DNA  origami  nanostructure373*  standards  with  three  designed 
target  patterns  to  different  DNA -PA  I  NT  imaging  conditions 
(Fig.  2b— d:  the  leftmost  columns  are  the  designed  pattern  schematics; 
the  five  right  columns  are  DNA-PAINT  images,  see  Methods  and 
Supplementary  Methods  4  for  details).  Synthetic  DNA  nanostruc¬ 
tures  provide  a  programmable  and  geometrically  precise  molecular 
patterning  platform  for  single -molecule  and  super- resolution 
studies.  These  structures  were  self-assembled  from  a  long  single- 
stranded  DNA  scaffold  and  a  collection  of  short  staple  strands37. 
By  extending  a  selected  subset  of  these  staple  strands,  DNA-PAINT 
docking  strands  could  be  arranged  into  user-prescribed,  geometrically 
precise  nanopattems  and  used  as  super-resolution  imaging  standards 
or  auxiliary  markers  (see  the  next  section,  Supplementary  Figs  4-6 
and  Supplementary  Note  8  for  details), 

[n  Fig.  2b,  two  10-nm  spaced  lines  (each  consisting  of  5  points, 
spaced  5  nm  apart  to  satisfy  the  Nyquist  criterion)  only  turned 
from  un resolvable  speckles  (left  image)  to  separable  lines  (right 
image)  with  an  increased  photon  count  per  single- molecule  localiz¬ 
ation  and  hence  a  higher  localization  precision.  However,  the  high 
localization  precision  alone  under  this  imaging  condition  failed  to 
resolve  two  points  spaced  by  10  nm  (Fig.  2c,  left  image)  due  to 
the  reduced  number  of  targets  (from  5  to  1  on  each  side).  The 
two  points  only  became  resolvable  (Fig,  2c,  right  image)  with  a 
larger  number  of  blinking  events  per  target  and  hence  an  increased 
target  SNR,  However,  this  imaging  condition  (with  a  high  localiz¬ 
ation  precision  and  target  SNR)  still  failed  to  resolve  a  24-target 
10-nm  grid  (Fig.  2d,  left  image)  due  to  the  increased  target 
density.  The  grid  points  only  became  individually  resolvable 
(Fig.  2d,  right  image)  with  a  decreased  blinking  duty  cycle  and 
hence  a  lower  false  localization  ratio.  Quantitative  pairwise  compari¬ 
sons  of  these  imaging  conditions  and  the  imaging  quality  assay 
results  before  and  after  meeting  each  of  the  three  blinking  require¬ 
ments  are  shown  in  Fig.  2e  (see  Methods,  Supplementary  Figs  7-12 
and  Supplementary  Methods  5.1  and  6.1  for  details).  Finally,  we 
imaged  these  structures  under  the  best  imaging  conditions  and 
obtained  dear  images  of  the  designed  patterns  (Fig.  2b-dT  rightmost 
column,  see  Supplementary  Figs  13  and  14  for  details). 

Subnanometre  accuracy  software-based  drift  correction 

DMI  also  imposes  stringent  requirements  on  microscope  stage  drift 
compensation.  For  example,  the  clear  separation  of  targets  with 

4 


5  nm  t a rget-to -target  spacing  requires  an  accurate  drift  correction 
of  <1  nm  (r.m.s.  drift)  due  to  the  compounded  effects  of  stage 
drift  and  finite  localization  precision  (assuming  1  nm  localization 
precision,  Fig,  3a;  see  Methods  and  Supplementary'  Methods  2  for 
details).  Previous  solutions  either  used  an  active  feedback  system 
to  reach  <1  nm  residual  drift,  which  requires  a  complicated 
hardware  set-up  and  is  technically  involved  to  implement-1,  or  used 
software-based  post-processing  methods  with  embedded  nano- 
partide  fiducial  markers,  which  are  typically  limited  by  imperfect 
tracking  accuracy5  7,22  and  do  not  support  DMI  resolution  (~5  nm). 

We  demonstrate  a  novel,  synthetic  nanostructure  fiducial  marker- 
based  stage  drift  correction  and  noise  compensation  method- 
termed  tempi ated  drift  correction— which  achieves  high-accuracy 
drift  and  noise  cancellation  (<I  nm  r.m.s.  residual  drift)  over  both 
short  and  long  timescales  (from  subsecond  to  ~-5  h),  without 
using  specialized  hardware.  The  key  principle  is  to  use  super- 
resolved  single-molecule  targets  (such  as  a  single  DNA  docking 
strand)  on  p  re -designed  geometrically  precise  nano  patter  ns  (such 
as  a  DNA  nanostructure)  as  drift  markers  (Fig.  3b).  Unlike  conven¬ 
tional  fiducial  markers,  these  single- target  drift  markers  possess 
unique  advantages  as  they  can  be  stably  anchored  on  the  surface, 
localized  with  high  precision  and  without  bleaching,  flexibly  rotata¬ 
ble  around  their  anchor  points  (thus  avoiding  the  fixed -dipole 
effect)  and  free  of  offset  from  the  fluorescence  centre  to  the  drift 
marker  centre  position,  such  as  in  micro  met  re- sized  beads  and 
previous  DNA  origami  drift  markers1031.  We  arranged  these 
single -target  markers  in  pre- designed,  well -separated  nanopattems 
(the  templates)  to  help  pack  a  number  of  single-target  markers 
within  a  diffraction-limited  area  and  to  allow  their  identification 
and  separation  during  software  processing  steps  (Fig.  3b). 
Moreover,  the  precise  geometry  of  a  nano  pattern  correlates  all  of 
the  single -target  drift  markers  on  this  pattern  and  effectively 
increases  their  on-fraction  (the  fraction  of  time  that  the  marker  is 
bright),  producing  a  more  accurate  drift  correction.  We  term  this 
method  geo  met  ry- temp  lated  drift  correction. 

We  designed  a  square  lattice  pattern  with  20  nm  spacing  as  our 
nanopattem  template  to  implement  this  strategy  (Fig.  3c):  a  grid  con¬ 
sists  of  12  targets  (docking  strands),  each  of  which  will  be  treated  as  a 
single-target  drift  marker.  After  performing  DNA-PAINT  imaging 
and  applying  a  round  of  simple  trace- averaging  drift  correction,  all 
12  targets  are  dearly  separated  from  each  other  in  the  reconstructed 
image,  allowing  them  to  serve  as  single-target  drift  markers  (Fig.  3d, 
see  also  Supplementary  Fig.  15).  Note  that  the  missing  grid  points 
were  probably  due  to  defects  in  the  origami  self-assembly  or  DNA 
syn thesis  (see  Supplementary'  Note  9.1  for  discussion),  rather  than 
imaging  incompleteness.  Single- particle  analysis  confirmed  the  com¬ 
pleteness  of  the  nanopattern  and  showed  a  high  degree  of  regularity  of 
the  grid  geometry  as  designed  (Fig.  3e,  see  Methods,  Supplementary 
Fig.  16  and  Supplementary'  Method  6.2  for  details).  Specifically,  we 
performed  two-dimensional  (2D)  Gaussian  fitting  on  each  grid 
point  followed  by  a  regular  grid  fit  to  these  12  Gaussian- fit  ted 
centres,  and  observed  that  the  average  deviation  between  the 
Gaussian- fitted  and  regular  grid -fit  ted  centres  was  well  below  1  nm 
(<0,30  nm  r.m.s..  Fig.  3f;  see  Methods,  Supplementary  Methods  6.2 
and  Supplementary  Note  9.2  for  details). 

We  then  tested  the  performance  of  temp  lated  drift  correction  on 
the  grid  structures  using  the  analysis  workflow  in  Fig.  3i  (left  arrow, 
see  Methods  and  Supplementary  Methods  5.2  for  details).  Briefly, 
we  first  identified  a  poo!  of  separable  single-target  drift  markers, 
extracted  their  blinking  time  traces  and  determined  the  expected 
centre  position  for  each  target.  Then  for  every  frame  we  collected 
all  of  the  localizations  originating  from  these  drift  markers  and  cal¬ 
culated  a  global,  phot  on -weighted  average  of  'offset  vectors'  from 
each  localization  to  its  expected  centre  position  and  used  it  as  the 
drift  correction  vector.  Single-particle  analysis  after  temp  lated 
drift  correction  showed  a  sharper  image  of  the  grid  (Fig.  3g). 
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Figure  3  |  Principle  and  performance  of  DNA  nanostructure  tem plated  drift  correction,  af  The  effect  of  drift  on  imaging  quality  is  simulated  for  the 
biomolecular  complex  in  Fig,  la  with  1  nm  localization  precision  and  different  levels  of  stage  drift.  The  structure  can  only  be  dearly  visualized  with  1  non 
(r.m.s.)  of  drift  or  less.  b.  The  principle  of  the  templated  drift  correction  method  with  predesigned  nanostructure  patterns.  Illustrating  a  three-target  marker 
example,  the  schematics  show  a  nanopattern  design  with  single-target  markers  (left),  the  localization  time  traces  from  individual  single-target  markers 
(middle)  and  averaged  drift  correction  trace  after  combining  the  traces  from  many  markers  (right).  The  targets  and  traces  are  colour -matched.  Dashed 
arrows  indicate  many  more  structures  used  for  drift  correction,  c.  Design  schematic  of  a  3  *  4  square  grid  with  20  nm  point-to-point  spacing  on  a  DNA 
origami  nanostructure.  Each  green  dot  indicates  a  docking  strand,  d.  Representative  DNA-PAINT  super-resolution  images  of  the  20  nm  grid  structure  in  c 
imaged  with  300  ms  frame  time,  30,000  total  frames  and  3  nM  imager  strands.  Missing  grid  points  were  probably  due  to  synthesis  or  incorporation  defects 
(see  Supplementary  Note  9.1  for  more  discussion),  e„  Single- particle  averages  of  20  nm  grid  images  (n  -  700)  after  trace  averaging.  The  overlaid  crosses 
indicate  Gaussian  fitted  centres  and  regular  grid-fitted  centres  using  the  red  crosses  as  fitting  targets.  f„  Root- mean-square  (r.m.s.)  deviation  between  the 
Gaussian  fitted  and  regular  grid -fitted  centres  in  e  in  X,  Y  projections  and  2D  distance,  g.b,  Single- particle  averages  of  the  20  nm  grid  images  (n  =  700)  after 
templated  (g)  and  geometry -tem  plated  (h)  drift  correction.  The  overlaid  crosses  indicate  Gaussian  fitted  and  regular  grid -fitted  centres  as  in  e.  The  same 
colour  code  for  different  stages  of  drift  correction  in  e,g  and  h  are  also  used  in  ij  and  Fig.  4.  1,  Procedure  for  templated  and  geometry-tern  plated  drift 
correction  with  20  nm  grid  structures  as  templates.  The  schematics  show  a  large  field-of-view  image  with  many  drift  markers  after  simple  trace  averaging 
(left).  Each  grey  circle  indicates  a  20  nm  drift  marker.  Three  zoomed-in  (squares)  schematics  show  a  super- resolved  20  nm  grid  marker  after  simple  trace 
averaging  (left),  after  templated  drift  correction  (middle)  and  after  geometry-tem plated  correction  (right).  Dashed  arrows  indicate  many  more  structures 
used  for  drift  correction.  Further  zoomed-in  schematics  (circles)  show  one  single-target  marker  and  the  calculation  of  the  offset  vectors.  In  the  zoomed-in 
schematics  (both  squares  and  circles)  grey  dots  indicate  localizations,  green  dots  and  lines  indicate  Gaussian -fitted  centres  and  regular  grid-fitted  lattices  as 
guides  for  templated  and  geometry-ternplated  drift  correction  calculations,  and  the  red  lines  represent  the  calculated  offset  vectors,  j,  Comparison  of  the 
allowable  imaging  resolution  (measured  in  FWHM,  orange)  and  estimated  remaining  drift  (turquoise)  at  different  stages  of  drift  correction.  For  more  details, 
see  Methods  and  Supplementary  Methods  2  and  5  for  the  simulation  and  analysis  methods,  and  Supplementary  Fig.  IS  for  the  super-resolution  images. 

Scale  bars,  10  nm  (a);  20  nm  (d.e,g,h  and  zoomed-in  images  in  t). 


Furthermore,  the  superior  regularity  of  these  20  nm  grid  structures 
allowed  us  to  perform  another  round  of  geometry'- tem  plated  drift 
correction,  again  using  these  structures  as  drift  markers  (Fig,  3i> 
second  arrow;  see  Methods  and  Supplementary  Methods  5.2  for 
details),  and  producing  an  even  sharper  single- particle  averaged 


image  (Fig.  3h),  We  also  compared  the  imaging  resolution  before 
and  after  templated  and  geometry- templated  drift  corrections, 
respectively,  and  estimated  <1  nm  {r.m.s.)  residual  drift  after  tem¬ 
plated  and  geometry -templated  corrections  (Fig.  3);  see  Methods 
and  Supplementary  Methods  6,2  for  details). 
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Figure  4  |  Systematic  quality  analysis  of  S  nm  grid  super- resolution  image,  a.  Design  schematics  of  a  4  *  6  triangular  grid  structure  with  -5  nm  point-to- 
point  spacing  on  a  DNA  origami  nanostructure.  Each  green  dot  indicates  a  docking  strand.  br  Allowable  imaging  resolution  assayed  by  two  methods  before 
drift  correction:  single-molecule  fitting  uncertainty  and  DAFlr  both  measured  in  FWHM.  cr  Comparison  of  the  DNA-PAlNT  images  of  a  5  nm  grid  structure 
and  a  20  nm  grid  drift  marker  (blue,  inset)  at  different  stages  of  drift  correction,  d.e,  Measured  imaging  resolution  assayed  by  two  methods  after  drift 
correction,  d,  Target  localization  spread  (TLS),  where  the  point  cloud  shows  the  overlapped  localizations  from  individually  separable  targets  aligned  by  the 
centre  of  mass.  Histograms  are  shown  for  the  horizontal  (left)  and  vertical  (top)  projections.  The  red  curves  indicate  Gaussian  fits,  e.  Fourier  ring  correlation 
(FRC)  The  correlation  curves  (blue  solid  lines)  and  noise-based  cutoff  (red  dotted  lines)  are  shown  for  10  representative  Images;  black  dots  indicate  crossing 
points,  f,  Comparison  of  the  measured  imaging  resolution  at  different  stages  of  drift  correction  assayed  by  TLS  and  FRC.  The  red  dashed  line  indicates  the 
localization  precision-limited  best  allowable  resolution  (as  determined  by  DAFL).  The  DNA-PAINT  imaging  conditions  used  for  this  experiment  were  400  ms 
frame  time,  40,000  total  frames  and  1  nM  imager  strand  concentration.  See  Methods,  Supplementary  Figs  17-21  and  Supplementary  Methods  3  and  5  for 
more  details  on  the  assay  methods  and  results.  Scale  bars,  TO  nm  in  the  images  in  c,  20  nm  in  the  insets  in  c  and  2  nm  in  d 


Note  that  during  templated  and  geometry- templated  drift  cor¬ 
rection  processes  the  global  average  of  all  of  the  offset  vectors  calcu¬ 
lated  from  many  origami  grids  across  the  entire  held  of  view  (rather 
than  those  from  a  single  origami  grid)  was  used  for  drift  correction. 
In  addition,  although  here  we  treat  the  20  nm  grid  as  both  drift 
markers  and  imaging  samples,  in  general  applications  (such  as  the 
5  nm  grid  image  in  the  next  section)  no  previous  knowledge  of 
the  imaging  sample  is  required— only  that  of  the  origami  grid 
markers  is  used. 

5  nm  grid  DMI  and  analysis 

To  demonstrate  the  imaging  capability  of  DMI,  we  designed  a 
triangular  grid  structure  with  —5  nm  point-to-point  spacing  (Fig.  4a). 
This  is  the  densest  clustering  pattern  possible  on  our  origami  bread¬ 
board37  and  it  also  mimics  the  monomer  spacing  and  arrangement 
in  a  microtubule  segment  (with  a  ~5  x  4  nm  monomer  size)3*. 

Compared  with  the  10  nm  grid  images  (Fig.  2b-d),  the  higher 
target  density  in  this  sample  imposes  more  stringent  imaging  con¬ 
ditions— namely,  an  even  higher  photon  count,  a  larger  number 
of  blinking  events  per  target  and  a  lower  blinking  duty  cycle.  We 
used  a  short  (7  nt)  docking  strand  to  accommodate  the  high 
target  density  and  to  avoid  potential  spatial  cross-talk  between 
neighbouring  targets,  then  carefully  adjusted  the  imaging  conditions 
to  satisfy  all  three  blinking  requirements  (Table  I)  with  a  high 
single- molecule  localization  precision  (1.6  nm  by  DAFL,  <1.0  nm 
from  single-molecule  fitting,  1.5  nm  by  theoretical  estimate15. 


Table  1  |  Critical  imaging  quality  parameters  for  the 
three  blinking  requirements. 


Parameter  Value 


Photon  count  (*1,000) 

50  ±  14 

Localization  precision  (nm) 

1.6  (0.9) 

Blinking  events  per  target 

77  ±15 

Target  sign  a  l-to- noise  ratio 

2.3  ±  1.0 

Blinking  duty  cyde  per  target  (%) 

0.3 

Fraction  of  false  localizations  (%) 

8 

The  localization  precision  valu*  in  brackets  wa$  measured  by  thesingle-moEecute  fHttTC  uncertainty. 

Fig.  4b).  See  Supplementary  Figs  17-21  for  details  on  the  imaging 
quality  characterization. 

We  used  the  20  nm  grid  structures  as  drift  markers.  After  apply¬ 
ing  each  step  of  the  drift  correction  procedure  {Fig.  3i),  both  the 
20  nm  grid  drift  markers  (Fig.  4c,  insets)  and  5  nm  grid  samples 
(Fig,  4c)  became  sharper  and  more  regular.  We  employed  two 
methods  for  assaying  the  integral  imaging  quality:  target  localization 
spread  (or  TLS,  carried  out  by  overlaying  the  localization  clouds 
from  well -separated  targets  and  measuring  the  spread  of  the  overlaid 
cloud  with  2D  Gaussian  fitting;  see  Fig,  4d)  and  Fourier  ring 
correlation  (FRC,  carried  out  by  computing  the  correlation 
between  the  2D  Fourier  transform  spectra  of  independent  half 
images,  Fig.  4e).  A  comparison  of  the  maximal  allowable  resolution 
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Figure  S  J  Discrete  molecular  imaging  of  5  nm  grid  structure,  a.  Representative  DMI  image  of  a  5  nm  triangular  grid  structure  obtained  with  DNA-PAINT. 
Inset:  Design  schematics,  where  each  green  dot  indicates  a  docking  strand.  Arrows  indicate  the  projection  directions  and  areas  of  study  for  b-e.  b,  Intensity 
projection  profiles  from  the  image  in  a  along  the  directions  indicated  by  the  colour-matched  arrows.  The  profiles  are  aligned  by  the  central  peaks  indicated 
by  red  arrows.  c,  Cropped- out  image  from  a  showing  the  central  region  (grey  rectangle)  and  central  pixel  line  (the  magenta  line  and  arrows)  used  for  the 
analysis  in  d  and  e  d,  Intensity  profile  along  the  central  line  (magenta)  and  projection  from  the  central  region  (grey),  as  indicated  by  the  col  our -matched 
regions  in  c  The  four- peak  Gaussian  fit  is  shown  by  the  dashed  black  lines.  Numbers  indicate  the  fitted  centre  positions  (top)  and  the  standard  deviation 
values  (bottom)  For  each  peak,  with  an  average  of  1  7  nm,  supporting  a  4.0  nm  FWHM  resolution,  e,  Auto -cor  relation  analysis  from  the  colour-matched 
profiles  in  d.  showing  a  consistent  periodicity  of  5,7  nm,  f.  Automatic  multitarget  fit  of  the  5  nm  grid  image  in  a.  The  crosses  indicate  Gaussian-fitted  centres 
(green)  and  regular  grid-fitted  centres  using  the  green  crosses  as  targets  (blue).  Inset:  r.m.s.  deviation  between  the  green  and  blue  crosses  (<0.5  nm  in  ID 
and  <0,7  nm  in  2D),  g,  Representative  images  of  the  5  nm  grid  structures,  showing  structural  regularity  and  heterogeneity.  For  each  structure  the  left  panel 
shows  the  super- resolution  rendered  image  and  the  right  panel  shows  the  automatic  fitted  image,  b,  Single-particle  class  average  of  the  5  nm  grid  (n  =  25). 
The  green  dashed  line  and  arrow  indicate  the  axis  and  operation  of  symmetry  of  the  structure,  i,  Uniformity  of  the  blinking  kinetics  as  represented  on  a  5  nm 
degenerate  grid.  The  colour  maps  show  averages  (left)  and  coefficients  of  variation  (right)  of  the  numbers  of  blinking  events  for  each  distinguishable  target, 
j,  Automatic  multitarget  fit  (grey)  and  two-component  grid  fit  of  the  5  nm  image  in  a.  allowing  an  offset  between  two  groups  of  targets  with  opposite  staple 
strand  orientations,  coloured  in  green  and  blue  respectively,  k.  Offsets  between  the  two  groups  of  staples  in  J  measured  from  single -molecule  images.  The 
error  bars  indicate  standard  deviation  (n  =  10),  It  is  important  to  note  that  no  previous  knowledge  of  the  sample  structure  (the  5  nm  grid)  was  used  to 
produce  the  above  results.  The  DNA-PAINT  imaging  conditions  used  for  this  experiment  were  400  ms  frame  time,  40,000  total  frames  and  1  nM  imager 
strand  concentration.  See  Supplementary  Figs  17,  22,  23  and  Supplementary  Methods  5  and  6  for  super- re  solution  images  and  analysis  details,  and 
Supplementary  Note  9  for  discussions.  Scale  bars,  10  nm  in  all  panels. 


is  shown  in  Fig.  4f— 4,3  nm  (by  TLS)  and  3.7  nm  (by  FRC)  after 
geometry -temp] at ed  drift  correction,  which  allowed  us  to  estimate 
the  residual  drift  to  be  <1  nm  (rm.s.A  See  Methods  for  details  on 
image  quality  characterization. 

Plotting  the  projection  profiles  along  the  three  symmetry  axes  of 
the  triangular  grid  structure  further  confirmed  the  DMI  imaging 
quality  and  the  structural  regularity  of  the  nanostructures  on  a 


representative  single- molecule  image  of  a  5  nm  grid  (Fig,  5a>b). 
Variations  in  the  peak  intensities  resulted  from  missing  grid  points, 
which  we  note  again  were  probably  due  to  nanostructure  synthesis 
defects  (see  Supplementary  Note  9.1  for  more  details).  A  projection 
histogram  of  the  four  targets  in  a  central  column  and  the  auto¬ 
correlation  analysis  showred  an  averaged  1.7  nm  target  standard  devi¬ 
ation  (equivalent  to  4.0  nm  resolution  in  FWHM;  Fig,  5c,d)  and 
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Figure  6  J  Discrete  molecular  imaging  with  complex  patterns  and  multiplexed  visualization*  a-d  DM  I  of  a  five -character  pattern  CWys$ir)  on  a  QNA 
origami  nanodisplay  board  with  5  nm  pixel  size.  a.  Design  schematics.  Each  dot  indicates  a  staple  strand,  Green  dots  were  extended  with  DNA-PAINT 
docking  strands,  b,  Single -particle  class  average  of  the  Wyss!  pattern  (n  -  85),  c,  A  representative  single- moiecule  image  of  the  WyssJ  pattern  under  DM), 
d  Overlay  of  the  design  schematics  on  top  of  the  automatically  fitted  single- molecule  image  in  c,  e-h,  Three-colour  multiplexed  DM  I,  each  colour  indicates 
a  separate  imaging  channel  with  an  orthogonal  DNA-PAINT  sequence,  er  Design  schematic  of  a  three-colour  dual-purpose  drift  and  alignment  marker* 
f,  Cross-channel  alignment.  Three  single-channel  images  (left  three  columns)  and  one  composite  image  (right  column)  are  shown  for  two  example 
alignment  markers,  g,  Design  schematics  of  a  three-colour  5  nm  grid  structure,  h.  Representative  multiplexed  DMI  image  of  three-colour  5  nm  grid 
pattern  in  g.  DNA-PAINT  super- resolution  images  (top  row)  and  automatically  fitted  images  (bottom  row)  are  shown  For  all  three  single-colour  channels 
(left  three  columns)  and  the  combined  image  (right  column)  for  two  representative  5  nm  grid  structures.  The  DNA-PAINT  imaging  conditions  used  in 
these  experiments  are  as  follows.  'Wyss!'  letter  pattern  image:  500  ms  frame  time,  100,000  total  frames  and  0,4  nM  imager  strand  concentration. 
Multicolour  pattern  image:  400  ms  frame  time,  2-3  nM  imager  strand  concentration  and  20,000  total  frames  for  each  colour  channel.  See  Supplementary 
Figs  24  and  25  for  more  super-resolution  images,  Methods  and  Supplementary  Methods  S  and  6  for  the  Image  analysis  methods.  Scale  bars,  10  nm  (Mb); 
20  nm  (f). 


5,7  nm  intertarget  spacing,  consistent  with  the  design  (Fig.  5e).  We 
next  performed  automatic  target  detection  and  2D  Gaussian  fitting 
to  each  target,  followed  by  regular  grid  fitting  to  the  Gaussian- fitted 
centres  (Fig.  5f,  see  Methods  and  Supplementary  Methods  6.2  for 
details)*  The  average  deviation  between  the  Gaussian -fitted  and 
grid- fitted  centres  was  measured  to  be  below  1  nm  (with  a  single¬ 
target  localization  precision  of  0.18  nm;  Fig,  5f)< 

Each  imaging  session  produced  -50-100  single- molecule 
images  of  well-resolved  5  nm  grids  (Fig*  5g),  We  next  performed 
single-part ide  averaging  on  them  and  studied  the  single- molecule 
imaging  heterogeneity  among  them  on  the  degenerate  (half)  grid 
(due  to  the  180°  rotational  symmetry;  see  Fig.  5h>  Supplementary' 
Fig,  16  and  Supplementary  Note  9.2).  Specifically*  we  computed 
the  coefficient  of  variation  in  the  number  of  blinking  events 
across  different  targets  in  the  degenerate  grid  (0.07)  and  across 
different  single- molecule  images  at  the  same  target  (&2Q)*  These 
results  demonstrated  a  relatively  uniform  imaging  efficiency 
(Fig.  5i,  see  Supplementary  Fig,  22  and  Supplementary  Methods 
6.2  for  details)*  which  further  supports  the  hypothesis  that  the 
missing  grid  points  probably  resulted  from  strand  synthesis  or 
incorporation  defects  rather  than  non-uniform  imaging  efficiency 
(see  Supplementary  Note  9.1  for  details). 

We  analysed  the  origami  staple  wiring  pattern  on  single- molecule 
DMI  images  with  angstrom -level  precision  (<2  A,  Fig,  5j*k).  By 
performing  an  automatic  two-component  grid  fitting  algorithm  on 
single- molecule  images,  we  determined  the  structural  offset  between 
two  groups  of  staples  with  opposing  orientations  (0.6  + 0.1  nm)* 
which  is  consistent  with  the  prediction  from  structural  analysis 
(0.6  nm,  see  Supplementary  Fig,  23  for  details), 

DM!  in  complex  and  multiplexed  samples 

We  also  demonstrated  DMI  in  a  complex  arrangement  by  construct¬ 
ing  and  visualizing  a  custom-designed  letter  pattern  (‘WyssF)  on  the 
60  x  85  nm  origami  nanodisplay  breadboard  with  5  nm  display 
pixel  size  (Fig.  6a  and  Supplementary  Fig.  24).  The  single- particle 
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class  average  showed  an  average  resolution  of  4,6  nm  in  FWHM* 
allowing  the  visualization  of  individual  targets  (Fig.  6b).  A  represen¬ 
tative  single- molecule  super-resolution  image  (Fig,  6c)  displays 
individually  distinguishable  targets  arranged  in  the  designed 
pattern  (Fig.  6d). 

Multiplexed  DMI  further  requires  accurate  registration  between 
multiple  imaging  channels.  For  example,  to  achieve  5  nm  imaging 
resolution,  <1  nm  registration  accuracy  is  required.  Traditional 
multitarget  imaging  and  colocalization  studies  on  molecular  scale 
rely  on  registration  between  multiple  spectral  channels  and  either 
require  a  specialized  and  complicated  hardware  set-up  and  cali¬ 
bration  process,  and  hence  could  be  practically  difficult  to  extend 
beyond  two-colour  registration  or  to  the  entire  field  of  view,  or 
achieve  a  suboptimaJ  registration  accuracy  that  does  not  allow  mol¬ 
ecular  resolution  imaging21,40. 

We  combined  DMI  with  Exchange- PAINT*0  and  demonstrated 
highly  accurate  (<1  nm)  three-colour  registration*  in  addition  to 
highly  accurate  drift  correction  (<1  nm  rms.)  within  each  channel. 
The  method  uses  only  one  fluorophore  and  one  optical  path*  thus 
greatly  simplifies  cross- channel  registration  and  naturally  extends  to 
the  entire  field  of  view.  To  implement  this  method,  we  first  designed 
a  three-colour  nanogrid  dual-purpose  drift  and  alignment  marker, 
wThere  each  colour  comprises  a  regular  square  grid  with  20  nm 
lattice  spacing,  similar  to  the  20  nm  grid  used  above  (Fig,  6e).  We 
designed  three  oithogonal  imager  sequences  writh  optimized  binding 
on-time,  and  labelled  the  substructure  in  each  colour  with  a  unique 
sequence.  We  performed  multiplexed  DMI  imaging  through  serial 
buffer  exchange  (Exchange- PAINT)*  and  then  performed  drift 
correction  using  these  dual-purpose  markers  for  each  channel  indivi¬ 
dually*  followed  by  cross- channel  alignment  between  each  pair  of 
different  colours  (Fig*  6fi  see  Methods,  Supplementary  Fig.  25  and 
Supplementary  Methods  6,2  for  details).  The  high-precision  visualiza¬ 
tion  of  individual  targets  (down  to  the  angstrom  level,  as  shown  above) 
allows  highly  accurate  alignment  across  all  channels  {<1  nm)*  enabling 
DMI  imaging  (Fig.  6f). 
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We  then  imaged  a  three-colour  mixture  structure  of  the  5  nm 
grid  with  multiplexed  DMI  with  an  average  DAFL  localization  pre¬ 
cision  of  2.0  nm  and  a  TLS  resolution  of  5.9  nm  in  FWHM 
(Fig.  6ghh,  Supplementary  Fig.  25).  Automatic  target  fitting  to 
single-molecule  structures  revealed  the  regular  grid  pattern 
as  designed. 

Conclusion 

DMI  differs  from  current  super- resolution  fluorescence  microscopy 
techniques  in  a  similar  manner  to  digital  versus  analogue  signal 
processing.  Whereas  current  super- resolution  demonstrations 
typically  focus  on  depicting  the  continuous  morphology  of  macro- 
molecular  structures  and  the  bio  molecule  spatial  distributions 
(analogue) 2 with  the  resolution  limited  by  the  sampling 
density  (the  Nyquist  sampling  criterion),  DMI  imaging  aims  at 
discrete  visualization  of  each  individual  molecular  component 
within  these  structures.  Combined  with  the  high  multiplexing 
power  of  Exchange-PAINT,  DMI  could  potentially  enable  the 
determination  of  the  position  and  identity  of  each  molecular 
component  in  a  complex  biological  or  synthetic  nanoscale  system, 
and  thus  offers  a  complementary  method  to  electron  microscopy 
and  crystallography  with  single -molecule  sensitivity.  Furthermore, 
the  Angstrom- level  precision  (<2  A)  structural  study  of  DNA  nano¬ 
structures  suggests  that  DMI  could  be  applicable  to  studying  sub- 
molecular  and  even  potentially  atomic-level  features  (for  example, 
the  macro  molecular  composition  and  component  positions, 
geometric  distortions  and  conformational  changes). 

We  see  two  challenges  in  the  further  development  of  DMI.  The 
first  comes  from  the  physical  trade-off  between  spatial  and  temporal 
resolution  l3-21\  Specifically,  achieving  higher  spatial  resolutions 
requires  longer  blinking  on-times,  a  larger  number  of  blinking 
events  and  a  lowrer  blinking  on-off  duty  cyde^all  of  which  necess¬ 
arily  increase  the  total  image  acquisition  time.  Potential  ways  to 
shorten  the  imaging  time  include  engineering  brighter  fluorophores 
to  shorten  the  blinking  on-time  required  to  collect  enough  photons. 
The  second  challenge  is  imposed  by  the  imperfect  labelling  of 
bio  molecular  targets.  Although  our  method  is  not  limited  by  the 
Nyquist  sampling  criterion  (the  number  of  probes  labelled 
per  unit  space),  it  is  still  limited  by  the  absolute  labelling  efficiency 
(the  average  number  of  probes  labelled  per  molecular  target). 
Conventional  immunostaming  methods  with  IgG  antibodies  both 
introduce  a  large  offset  from  the  target  epitope  to  the  chemically 
conjugated  probe  (-10  nm)  and  often  have  limited  labelling 
efficiencies  (due  to  the  large  size  and  sometimes  the  low  affinity 
of  the  antibodies).  Several  approaches  could  potentially  address 
these  issues:  genetically  labelled  tags  (for  example  SNAP-tags, 
unnatural  amino  acids)  can  provide  a  smaller  probe  size  and 
higher  labelling  efficiency;  ap tamers,  small-molecule  labels,  single¬ 
chain  antibody  fragments  and  cam  el  id  single -chain  antibodies 
(nanobodies)  could  also  provide  effective  alternatives  for  smaller 
affinity  probes. 

DMI  allows  the  direct  visualization  of  each  individual  com- 
portent  in  densely  packed  biomolecular  environment.  In  the 
future,  we  expect  the  development  of  DMI  to  eventually  allow  quan¬ 
titative  molecular  features  to  be  studied  in  diverse  biological  systems 
such  as  the  molecular  composition  and  architecture  of  diverse  cellu¬ 
lar  systems  (for  example,  cell  membrane  receptor  clusters,  neuronal 
synapses),  the  molecular  states  of  individual  protein  components 
within  macro  molecular  context  (for  example,  the  binding  and 
rotational  states  of  ring  ATPase  complexes,  patterns  of  histone 
modifications)  and  the  3D  architecture  of  chromosomes  with  high 
spatial  and  genomic  resolution. 

Methods 

Methods  and  any  associated  references  are  available  in  the  online 
version  of  the  paper. 
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Methods 

Materials  and  buffers.  Unmodified  and  biotin-labelled  ON  A  oligonucleotides  were 
purchased  from  Integrated  DNA  Technologies.  F kuorcscently  modified  DNA 
oligonucleotides  were  purchased  from  Biosynthesis.  StrcptavidLn  was  purchased 
from  Invitrogen  [catalog  number  5-888).  Biotinylated  bovine  serum  albumin 
{BSA-biotin,  catalog  number  A8549),  Protocatechuate  3,4-Dioxygenasc  (PCD, 
catalog  number  P8279),  Protocatechuic  acid  (PC A,  catalog  number  37580)  and 
6-Hydroxy-2,5J,8-tetramethylchrnmane-2-carboxylic  acid  (Trolox,  catalog  number 
238313)  were  purchased  from  Signs  a- Aldrich.  Glass  slides  and  coverslips  were 
purchased  from  VWR.  The  Ml3mpl3  scaffold  was  purchased  from  New  England 
BioLabs,  Freeze  ‘N  Squeeze  columns  were  purchased  from  Bio- Rad. 

The  following  buffers  were  used  for  sample  preparation  and  imaging:  DNA 
origami  folding  buffer  (12.5  mM  MgQj,  lx  TE  buffer),  buffer  A  (10  mM  Tris-HCI, 
100  mM  Nad,  0.1%  (v/v)  Tween  20,  pH  8.0).  buffer  B  (10  mM  Tris-HCl  10  mM 
MgClj,.  1  mM  EDTA,  0.1%  (v/v)  Tween  20,  pH  8.0)  and  buffer  TP  (lx  buffer  B, 

10  nM  PCD,  2.5  mM  PCA,  1  mM  Trolox). 

Fluorescence  microscopy  set-up.  Fluorescence  imaging  was  carried  out  on  an 
inverted  Nikon  Eclipse  Ti  microscope  (Nikon  Instruments)  with  the  Perfect  Focus 
System,  applying  an  objective -type  TIRF  configuration  using  a  Nikon  TIRF 
illuminator  with  an  oil -Immersion  objective  (CFl  Apo  TIRF  I00x,  numerical 
aperture  (NA)  1,49),  Laser  excitation  with  a  561  nm  laser  (200  mW,  Coherent 
Sapphire)  was  passed  through  a  clean-up  filter  (ZET561/10,  Chroma  Technology) 
and  coupled  into  the  microscope  using  a  beam  splitter  {ZT48Srdc/ZT561rde/ 
ZT64Grde,  Chroma  Technology).  Fluorescence  light  was  spectrally  filtered  with 
emission  filter  (,ET60050mf  Chroma  Technology),  Super- resolution  movies  were 
recorded  with  either  an  electron  multiplying  charge -coup  led  device  (EM  CCD,  used 
without  EM  gain  option)  camera  (iXon  X3  DU -897,  An  dor  Technologies)  or  a 
scientific  complementary  metal -oxide-semiconductor  device  (sCMOS)  camera 
(Zyla  4.2,  Andor  Technotogies). 

SiinuLation  of  the  microscopy  data  set.  Simulation  of  the  microscopy  data  sets  was 
performed  with  custom- written  MATLAB  software  for  Figs  I,  3  and  Supplementary 
Fig.  3.  with  realistic  parameters  determined  from  fluorescence  microscopy 
experiments,  including  image  and  pixel  sizes,  camera  conversion  factor  and  noise 
level,  fiuorophore  photon  emission  rate  and  imaging  background  noise.  Stochastic 
and  independent  blinking  kinetics  was  simulated  for  ail  images  apart  from  the  first 
two  blinking  requirement  tests  in  Supplementary  Fig.  3.  Intensity  distributions  from 
Single’ molecule  blinking  events  were  generated  with  finite  pixel  approximation  of 
Gaussian  profiles  and  corrupted  with  Poisson  noise  and  Gaussian  background  and 
readout  noise.  See  Supplementary  Methods  2  for  more  details. 

Imaging  quality  characterization  for  three  blinking  requirements.  For  blinking 
requirement  (1)  the  photon  count  was  calculated  by  converting  camera  counts  to 
photons  using  camera  manufacture -provided  conversion  factor.  The  localization 
precision  was  characterized  by  two  methods.  DA  FI,  was  calculated  for  pairs  of 
localizations  that  originated  from  the  same  blinking  events  but  were  separated  into 
two  adjacent  frames;  the  distribution  of  all  of  the  distances  between  the  pairs  was 
fitted  to  the  theoretical  distribution  function  and  the  localization  precision  was 
determined  from  the  fit,  The  Gaussian  fitting  uncertainty  reports  the  Cramer-Rao 
lower  bound  (CRI,B)  for  2D  Gaussian  fitting  for  each  localization44. 

For  blinking  requirement  (2)  the  number  of  blinking  events  was  calculated  for 
each  imaging  target  from  the  single- molecule  blinking  time  trace  by  counting  the 
number  of  on-off  switchings  within  the  time  trace.  The  target  SNR  was  calculated 
for  each  pair  of  neighbouring  targets  by  either  automatically  or  manually  selecting  a 
region  of  interest  that  enclosed  both  targets  and  computing  the  localization 
distribution  along  the  axis  connecting  them;  two-peak  Gaussian  fitting  was 
performed  and  the  peak- to- valley  distance  and  residual  noise  were  used  as  signal 
and  noise. 

For  blinking  requirement  (3)  the  blinking  duty  cycle  was  calculated  for  each 
Structure  from  the  structure's  blinking  time  trace  by  calculating  the  characteristic 
on-time  and  off-time  respectively.  These  were  obtained  by  fitting  the  cumulative 
distribution  of  ad  on-  and  off- times  both  of  and  between  blinking  events  to  the 
expected  distribution  functions,  False  localizations  were  determined  from 
abnormally  high  photon  counts  by  a  photon  count  threshold  (2o  above  the  mean 
photon  count).  The  effective  localizations  for  the  simulations  were  determined  by  a 
distance  cutoff  between  the  localized  position  and  the  simulated  (true)  positions 
with  a  3o  threshold. 

For  each  10  nm  comparison  structure  under  each  imaging  condition  the 
corresponding  technical  requirement  was  measured  using  the  methods  described 
above.  In  addition  a  projection  histogram  from  the  marked  region  in  the  image  was 
generated  and  fitted  to  a  multi-peak  Gaussian  distribution.  See  Supplementary  Fig,  3 
and  Supplementary  Methods  3  for  more  details  regarding  these  methods, 

DNA  origami  design  and  self-assembly.  All  DNA  origami  nanostructures  were 
designed  with  the  caDNAno  software1*5,  and  were  based  on  a  twist -corrected  variant 
of  the  rectangular  structure57  (see  Supplementary  Tables  1-5  for  the  sequence 
details).  The  DNA  origami  structures  used  as  imaging  standards  with  specific 
dimensions  (20  nm  square  grid,  10  nm  comparison  standards  and  5  nm  triangular 


grid)  were  designed  on  the  basis  of  length  measurements  from  AFM  {Supple  me  ntary 
Fig,  4).  Eight  staple  strands  were  biotin -modified  for  surface  fixation.  Drift  marker 
structures  used  in  the  20  nm  comparison  pattern  experiments  were  folded  with 
DNA -PAINT  extension  on  all  possible  staple  strands.  Staple  strands  used  as  imaging 
targets  were  extended  with  DNA- PAINT  docking  sequences  (7-10  nt  in  length) 
with  one  or  two  thymine  base(s)  spacers.  See  Supplementary  Figs  4-6  for 
more  details. 

The  DNA  origami  20  nm  square  grid  structures  were  self-assembled  in  a  one-pot 
annealing  reaction  with  50  pi  total  volume  containing  10  nM  scaffold  strand 
(ml3mpl8),  100  nM  unmodified  staple  strands,  120  nM  biotin-modified  strands 
and  1  jiM  strands  with  DNA- PA  I  NT  extensions  in  DNA  origami  folding  buffer. 
Prift  markers  for  20  nm  grid  image  were  self-assembled  with  400  nM  of  all  of  the 
staple  strands  with  DNA -PAINT  extensions.  The  10  nm  comparison  patterns>  5  nm 
grid  and  ^Wyss!'  pattern  structures  were  self- assembled  with  500  nM  biotin- 
modified  staple  strands  and  l  pM  staple  strands  with  DNA-PAINT  extensions.  The 
three-colour  5  nm  grid  structure  was  self-assembled  with  120  nM  biotin- modified 
staple  strands  and  i  pM  staple  strands  with  DNA-PAiNT  extensions.  For  the  20  nm 
square  grid  and  10  nm  comparison  pattern  struc lures  the  solution  was  annealed 
with  a  thermal  ramp  cooling  from  90  *C  to  25  *C  over  the  course  of  7 5  min.  For  the 
5  ntn  grid  and  lWyss!'  pattern  structures,  the  solution  was  annealed  with  a  thermal 
ramp  cooling  from  90  °C  to  20  °C  over  the  course  of  3  h  and  for  the  three-colour 
20  nm  grid  and  5  nm  grid  structures  the  solution  was  annealed  with  a  thermal  ramp 
cooling  from  90  SC  to  20  *C  over  the  course  of  72  h. 

The  self  assembled  DNA  origami  structures  were  characterized  and  purified 
(except  for  the  20  nm  square  grid  structures)  by  agarose  gel  electrophoresis  (2% 
agarose,  0.5x  TBE,  10  mM  MgCl2,  0.5x  SybrSafc  pre-stain)  at  4,5  V  cm"1  for  1.5  h. 
For  purification,  the  gel  bands  were  cut,  crushed  and  filled  into  a  Freeze  'N  Squeeze 
column  and  spun  for  It)  min  at  800g  at  4  °C 

DNA-PAINT  sample  preparation  and  imaging.  DNA-PAINT  sample  preparation 
was  performed  in  custom- constructed  flow  chambers  between  a  piece  of  coverslip 
and  a  glass  slide  or  on  a  commercial  flow  chamber  slide  (ibidi).  Sample  structures 
were  fixed  on  the  surface  via  a  biotin- strep lavid in -biotin  bridge  by  serially  flowing 
BSA-b iotin  (1.0  mg  ml"1),  streptavidin  (0,5  mg  ml"1)  and  biotin -labelled  samples. 
The  sample  concentration  was  calibrated  for  different  structure  and  imager 
combinations  to  make  sure  that  similar  numbers  of  blinking  events  are  detected  per 
camera  frame.  The  flow  chamber  was  filled  with  an  imaging  buffer  (an  appropriate 
concentration  of  dye- labelled  Imager  strands  in  buffer  TP)  and  incubated  for  10  min 
before  imaging.  For  imaging  with  the  custom -construe ted  How  chambers  the  flow 
chamber  was  sealed  with  epoxy  glue  before  imaging.  See  Supplementary  Methods  4 
for  the  flow  chamber  protocol  details  and  Supplementary  Table  6  for  the 
imager  sequences. 

Exchange- PAINT  imaging  for  the  three-colour  samples  was  performed  based  on 
protocol  adapted  from  our  previous  work10.  In  brief,  the  DNA  origami  sample  was 
prepared  in  a  flow  chamber  and  a  buffer  exchange  was  performed  by  serially  flowing 
in  400  pJ  of  buffer  B  and  then  200  pi  of  the  next  imaging  buffer  into  the 
imaging  chamber, 

DNA-PAINT  super- resolution  imaging.  DNA-PAINT  super  resolution  movies 
for  the  10  nm  standard  patterns  were  captured  with  5  Hz  camera  frame  rate  (200  ms 
per  frame)  for  all  images.  The  laser  intensity  was  varied  from  0.3- 1.0  kW  cm  2 
before  and  after  meeting  requirement  (I ),  The  imaging  length  was  varied  from  2,500 
to  12,500  frames  before  and  after  meeting  requirement  (2),  and  to  40,000  frames 
after  meeting  requirement  (3).  The  imager  concentration  was  varied  from  20  nM  to 
5  nM  before  and  after  meeting  requirement  (3).  DNA-PAINT  movies  for  the  lbest 
condition’  10  nm  standard  patterns  in  Fig,  2  was  captured  with  4  Hz  camera  frame 
rate  (250  ms  per  frame)  for  (b)  and  (c),  2,5  Hz  frame  rate  (400  ms)  for  (d);  30,000 
total  imaging  frames  for  (b)  and  (c),  50,000  frames  for  (d);  15  nM  imager 
concentration  for  (b)  and  (c)  and  5  nM  for  (d).  The  laser  intensity  was  1.0  kW  cm  1 
for  all  images.  DNA-PAINT  movies  for  the  20  nm  grid  images  were  captured 
with  3,3  Hz  camera  frame  rate  (300  ms  per  frame),  with  1.0  kW  cm-2  laser  intensity, 
and  3  nM  of  dye- label  led  imager  strand  For  30,000  frames,  DNA-PAINT  movies  for 
the  5  nm  grid  images  were  captured  with  2.5  Hz  camera  frame  rate  (400  ms  per 
frame),  with  1.0  kW  cm"2  laser  intensity  and  ]  nM  of  dye-labelled  imager  strand  for 
40,000  frames.  The  DNA-PAINT  movie  for  the  HWyssf  pattern  was  captured  with 
2  Liz  frame  rate  (500  ms)  and  0.4  nM  iniager  strand  concentration  for  100,000 
frames.  The  DNA-PAINT  movie  for  the  three-colour  5  nm  grid  was  captured  with 
2,5  Hz  frame  rate  (400  ms)  and  2-3  nM  imager  strand  concentrations  with  20,000 
frames  per  colour. 

Extra  drift  markers  were  supplemented  for  some  images.  For  the  20  nm  grid 
structure,  extra  drift  markers  with  DNA-PAINT  docking  extensions  on  all  strands 
were  used.  For  the  10  nm  two-target  structure,  5  nm  grid  and  Wyss!1  paitern  images, 
extra  drift  markers  of  the  20  nm  grid  structures  were  used.  For  the  three-colour  5  nm 
grid  images,  further  dual-purpose  drift  and  alignment  markers  of  the  three-colour 
20  nm  grid  structures  were  used. 

Super-resolution  data  processing  and  image  analysis.  DNA- PAINT  super- 
resolution  movies  were  processed  with  custom -written  MATLAB  software,  In 
general,  images  were  processed  and  analysed  in  three  steps;  (i)  spot  detection  and 
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localization.,  (li)  drift  correction,  (Lii)  super  resolution  rendering  and  quality 
analysis.  Spot  detection  and  localization  was  performed  with  efficient  and  accurate 
Gaussian  fitting  algorithms  as  reported  in  ret  4*4.  For  the  10  nm  comparison  pattern 
images,  drift  correction  was  performed  with  simple  trace  averaging  only.  For  the 
20  nm  grid,  5  nm  grid,  Wyss!  pattern  and  three-colour  5  nm  grid  images,  drift 
correction  was  performed  with  simple  trace  averaging  followed  by  tern  plated  and 
geometry- templated  drift  correction  methods  with  the  20  nm  grid  markers,  as 
described  in  the  sections  below.  For  the  three-colour  5  nm  grid  images,  alignment 
across  different  channels  was  performed  following  drift  correction  procedures  with 
the  same  three-colour  20  nm  grid  dual-purpose  markers  described  in  sections  below. 
The  imaging  quality  was  characterized  separately  for  each  of  the  three  DM  I 
requirements  and  for  the  final  super -rendered  image  by  a  variety  of  methods,  as 
described  below.  Final  super- resolution  images  were  rendered  with  Gaussian 
blurring  with  the  standard  deviation  set  to  the  estimated  localization  precision. 

A  simplified  version  of  the  software  processing  suite  can  be  obtained  at 
htlp:/Vmokcular,syslcms/soflware  or  htlp^/mvw.dmi-imagmg.net.  See 
Supplementary  Methods  5  and  6  for  more  details. 

DNA  nanostructure-based  drift  correction.  Drift  compensation  based  on  the  DNA 
origami  marker  structures  was  performed  using  the  following  steps.  First  we  selected 
(either  automatically  or  manually)  a  pool  of  isolated  structures  as  drift  markers 
(either  the  20  nm  grid  markers,  all -modified  drift  markers  or  the  samples 
themselves)  and  took  a  simple  trace  average  of  their  blinking  time  traces  as  the  drift 
correction  trace.  This  was  the  only  drift  correction  method  used  for  the  10  nm 
comparison  patterns  images.  For  the  20  nm  grid,  5  nm  grid,  Wyss!  pattern  and  the 
three-colour  5  nm  grid  samples,  templated  drift  correction  and  geometry -tein plated 
drift  correction  methods  were  then  performed  with  the  aid  of  20  nm  origami  grid 
markers  in  a  frame-by-frame  manner.  For  each  frame,  the  algorithm  identified  all  of 
the  localizations  and  assigned  an  'offset  vector'  to  each,  to  be  used  for  averaging.  A 
phot  on- weighted  global  average  of  all  of  the  calculated  offset  vectors  was  used  for 
drift  correction  of  the  current  frame.  In  the  templated  drift  correction  method  the 
offset  vector  was  determined  from  the  localization  position  in  relation  to  the  target 
site  position,  which  was  determined  by  a  local  2D  Gaussian  fitting  of  all  of  the 
localizations  originated  from  that  target.  In  geometry- tern  plated  drift  correction 
method  the  offset  vector  was  determined  from  the  localization  position  relative  to 
tiler  regular  20  nm  grid- fitted  target  position  instead  of  the  Gaussian- fitted  target 
position.  The  remaining  drift  in  the  corrected  images  was  estimated  by  comparing 
the  maximal  allowed  imaging  resolution  (measured  by  the  DAFL  localization 
precision)  and  the  measured  imaging  resolution  (cither  by  Gaussian  fitting  of  the 
single- particle  averaged  image  or  by  the  TLS  method,  see  below)  using  the  quadratic 
sum  principle.  It  is  important  to  note  that  in  producing  the  5  nm  grid,  the  'Wyss!1 
pattern  and  the  three-colour  5  nm  grid  images,  no  previous  information  about  the 
sample  structures  was  used  for  the  templated  and  geometry- templated  drift 
correction  procedures.  See  Supplementary  Methods  5.2  for  more  details. 

Imaging  quality  characterization  for  super- rendered  images.  The  imaging 
resolution  for  super- rendered  images  was  characterized  using  a  few  methods.  Target 
localization  spread  (TLS)  was  calculated  by  overlaying  the  localization  cloud  for  all  of 
the  separable  targets  on  top  of  each  other,  aligned  by  the  centre  of  mass  position,  and 
measuring  the  standard  deviation  of  the  overlaid  cloud  oflocalizations;  the  FWKM 


value  was  reported  as  the  resolution  estimate.  Fourier  ring  correlation  (FRL)  was 
calculated  by  splitting  the  image  into  two  independent  half  images  by  cutting  the 
super-resolution  movie  stack  into  sections  of  ^100  frames  and  arbitrarily  assigning 
half  of  the  sections  to  each  half  image.  The  spatial  frequency  correlation  between 
their  respective  2D  fast  Fourier  transform  (FFT)  spectra  was  calculated  and  the 
crossover  point  with  the  noise -based  2a  threshold  curve  was  reported  as  the 
supported  imaging  resolution.  See  Supplementary  Methods  6.1  for  more  details. 

Single-particle  class  averaging  analysis.  The  single- particle  averaging  analysis  was 
carried  out  with  the  EMAN2  software  package  (version  2.0).  Images  of  the 
individual  particles  were  automatically  selected  and  super -rendered  with  a  pixel  size 
set  to  less  than  the  localization  precision  In  a  custom  MATLAB  program  and 
processed  with  a  reference  free  class  averaging  functionality  in  EMAN2  (e2refine2d), 
allowing  only  rotational  and  translational  transformations  during  alignment. 

A  number  of  particles  were  used  for  the  averaging  (N=  700  for  the  20  nm  square 
lattice,  N-  25  for  the  5  nm  grid  standard  and  N  -  85  for  the  Wyss!  pattern)  and  from 
each  session  the  most  representative  class  average  image  was  selected  as  the  final 
result.  See  Supplementary  Methods  6,2  for  more  details. 

Automatic  fitting,  regular  grid  fitting  and  cross- channel  alignment.  Automatic 
fitting  was  performed  on  the  20  nm  grid  (single-particle  class  average),  the  Sum  grid 
(non-averaged  single- molecule),  tire  'Wyss!1  pattern  (non -averaged  single- molecule) 
and  the  three-colour  5  nm  grid  (non-averaged  single -molecule)  Images.  Spot 
detection  was  performed  after  a  Gaussian  filter  was  used  to  suppress  the  background 
variation.  Automatic  2D  Gaussian  fitting  was  performed  for  each  detected  centre 
with  a  fixed  standard  deviation  determined  by  the  overall  image  resolution.  The 
fitted  image  was  rendered  with  the  fitted  positions  and  intensity  values. 

Regular  grid  fitting  analysis  was  carried  out  for  the  20  nm  grid  (single- particle 
class  average)  and  the  5  nm  grid  (both  non-averaged  super- resolution  Images  and 
single -particle  class  average)  with  an  automatic  algorithm  based  on  the  individually 
fitted  centres  (above).  The  grid  geometry  (a  square  lattice  for  20  nm  grid  and  a 
triangular  lattice  for  5  nm  grid)  and  number  of  grid  points  were  input  manually  and 
an  initial  estimate  of  the  grid  boundaries  was  also  manually  set  to  reduce  tire  fitting 
time.  The  best  fitted  grid  was  determined  by  minimizing  the  r.m.s,  deviation  of  the 
fitted  points. 

Cross -channel  alignment  with  three-colour  20  nm  grid  alignment  markers  was 
performed  by  applying  automatic  fitting  and  regular  grid  fitting  to  all  three 
substructures  of  the  alignment  marker  and  recording  the  offset  between  their  fitted 
positions.  The  cross- channel  registration  offset  was  then  calculated  by  comparing 
the  recorded  offset  with  the  pre  designed  offset  across  different  channels.  A  number 
of  (>10)  high-quality  alignment  markers  were  identified  and  their  offset  computed 
in  this  way;  the  average  from  all  of  them  was  used  as  the  final  registration  offset.  See 
Supplementary  Methods  6.2  for  more  details. 
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Dynamic  DNA  nanotechnology  provides  a  promising  avenue  for  implementing 
sophisticated  assembly  processes,  mechanical  behaviours,  sensing  and  compu¬ 
tation  at  the  nanoscale.  However,  design  of  these  systems  is  complex  and 
error-prone,  because  the  need  to  control  the  kinetic  pathway  of  a  system  greatly 
increases  the  number  of  design  constraints  and  possible  failure  modes  for  the 
system.  Previous  tools  have  automated  some  parts  of  the  design  workflow, 
but  an  integrated  solution  is  lacking.  Here,  we  present  software  implementing  a 
three  'tied  design  process:  a  high-level  visual  programming  language  is  used  to 
describe  systems,  a  molecular  compiler  builds  a  DNA  implementation  and 
nucleotide  sequences  are  generated  and  optimized.  Additionally,  our  software 
includes  tools  for  analysing  and  'debugging'  the  designs  in  silico ,  and  for 
I  reporting /exporting  designs  to  other  commonly  used  software  systems. 
The  software  we  present  is  built  on  many  existing  pieces  of  software,  but  is 
integrated  into  a  single  package — accessible  using  a  Web-based  interface  at 
http: //molecular-systemsreet/ workbench.  We  hope  that  the  deep  integration 
between  tools  and  the  flexibility  of  this  design  process  will  lead  to  better  exper¬ 
imental  results,  fewer  experimental  design  iterations  and  the  development  of 
more  complex  DNA  nanosystems. 


1.  Introduction 

DNA  has  been  demonstrated  to  be  a  robust  and  versatile  substrate  for  engineering 
static  nanostructures  [1,2]  and  dynamic  nanodevices  [3,4].  The  specificity  of 
Watson -Crick  base  pairing  [5,6],  combined  with  recent  improvements  in  thermo¬ 
dynamic  predictive  models  [7,8]  and  rapidly  decreasing  costs  for  commercial 
oligonucleotide  synthesis  [9],  have  resulted  in  an  explosion  of  research  in  DNA 
nanotechnology  [10],  in  which  hybridization  interactions  (rather  than  enzymatic 
processes)  are  primarily  used  to  implement  the  desired  molecular  behaviour. 
Recently,  the  field  has  progressed  beyond  primarily  demonstrations  of  static  equili¬ 
brium  structure  formation  to  the  design  of  dynamic  systems  with  kinetically 
controlled,  non -equilibrium  dynamics  [4] — including  molecular  machines  [11,12], 
motors  [13,14],  walkers  [15-17],  amplifiers  [16,18,19],  self-assembly  processes 
[16,20],  logic  circuits  [21,22]  and  other  sophisticated  computational  devices  [23,24]. 

Design  of  dynamic  nucleic  add  systems — both  at  the  high  level  of  abstraction 
and  at  the  low  level  of  sequence  design — requires  consideration  of  a  different  set 
of  design  parameters  and  metrics  from  the  design  of  static  DNA  nanostructures. 
Static  structures  are  designed  to  adopt  a  single,  minimum  free  energy  (MFE)  struc¬ 
ture;  dynamic  systems  additionally  require  design  of  a  kinetic  pathway — a  series  of 
reactions.  Disruption  of  any  of  these  intended  reactions  or  prevalence  of  unintended 
'side'  reactions  can  disturb  the  intended  behaviour.  For  instance,  poor  kinetics  of 
individual  reactions  can  lead  to  slow  performance  for  a  molecular  calculation;  unan¬ 
ticipated  'leak'  reactions  can  cause  unexpected  product  formation;  and  side  reactions 
that  produce  undesired  products  can  result  in  low  yields  of  the  intended  product. 
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These  problems  are  amplified  in  more  comp  lax  systems,  since  the 
number  of  possible  unintended  interactions  grows  at  least  quad- 
ratically  with  the  number  of  species.  Recent  demonstrations  of 
nucleic  acid  logic  circuits  now  exceed  over  100  different  molecu¬ 
lar  species  [23,24],  motivating  the  need  for  novel  software  to 
assist  the  design  of  complex  systems. 

To  describe  the  design  process  for  a  dynamic  DNA  system, 
we  propose  a  system  of  three  different  'tiers'  of  abstraction 
(figure  1).  The  first  tier  describes  desired  system  behaviour, 
using  a  set  of  high-level  abstractions  that  represent  molecular 
species  and  their  interactions  (figure  la).  The  second  tier  gives 
a  set  of  idealized  DNA  strands  and  interactions  (in  terms  of 
prescribed  regions  of  Watson -Crick  complementarities)  that 
implement  the  first-tier  behavioural  species  (figure  Ifr).  The 
lowest  tier  gives  specific  sequences  of  nucleotides  to  implement 
the  prescribed  complementarities  among  the  second-tier  species 
( figure  le).  In  principle,  one  may  begin  the  design  process  at  any 
of  the  tiers  and  software  should  automatically  translate  to  lower 
tiers;  similarly,  computer-aided  verification  at  any  tier  should  be 
possible.  In  practice,  the  first  two  tiers  are  traditionally  designed 
by  hand,  and  the  final  'sequence  design'  step  uses  one  of  a 
variety  of  computational  sequence  optimization  [26-30]  and 
verification  [25,31-33]  packages;  that  is,  only  sequence-tier 
design  and  analysis  is  automated. 

Recent  software  packages  have  made  impressive  advances 
in  automating  and  integrating  various  parts  of  this  design 


(future  work) 

Figure  1.  Tiers  of  abstraction  and  the  overview  of  the  design  workflow.  The  design  process  occurs  at  three  'tiers'  of  decreasing  abstraction:  at  the  level  of  (1) 
behavioural  description,  [2)  segment-level  complementarity  design  and  [3)  nucleotide  sequences.  The  design  workflow  involves  two  forms  of  tools:  (a-c)  describe 
tools  for  creating  dynamic  nucleic  acid  systems,  while  panels  (d-f)  describe  tools  for  analysing  and  testing  these  systems  in  silico.  Box  (d)  is  future  work,  not  yet 
incorporated  by  DyNAMiC  Workbench;  asterisks  denote  tools  that  use  previously  described  software.  Images  in  all  panels  except  [dtf]  were  generated  automatically 
by  DyNAMiC  Workbench,  (0)  Users  begin  by  describing  an  abstract  behaviour,  using  a  formalism  such  as  the  Nodal  language  of  Yin  el  ai  [16]  (future  extensions  may 
incorporate  alternative  behavioural  designers).  A  built-in  compiler  automatically  generates  a  segment-level  representation  (when  possible)  of  a  system  of  nucleic  acid 
strands  which  implements  the  desired  behaviour,  through  a  systematic  labelling  of  segments  corresponding  to  the  nodes  and  connections  specified  (see  §3.1).  (b)  A 
segment-tier  design  (specifying  the  Identity  and  complementarity  relationships  between  all  segments  in  the  system)  can  be  either  generated  from  the  behavioural 
designer  or  assembled  directly  by  the  user  (see  §3.2).  (c)  From  a  segment-level  design,  nucleotide  sequences  can  be  generated  with  one  of  a  variety  of  sequence 
design  packages.  DyNAMiC  Workbench  allows  users  to  interact  with  and  modify  systems  at  all  three  levels  (see  §3,3).  (d,e)  Reaction  enumerators  Identify  all  possible 
reaction  paths,  highlighting  possible  undesired  interactions.  Enumeration,  in  principle,  can  be  performed  at  the  behavioural  tier  among  behavioural  species,  or  at  the 
segment  tier  among  molecular  species.  The  results  of  both  enumerations  should  be  comparable,  but  the  segment  tier  enumeration  may  reveal  unintended  side- 
reactions  or  kinetic  traps  not  prescribed  by  the  behavioural  enumeration  (see  §3,4,1).  (f)  Base-tier  sequences  can  be  analysed  using  thermodynamic  methods  to 
identify  unintended  secondary  structure  in  monomers  [23],  as  well  as  some  unintended  interactions  between  species  (see  §3,4.2), 


process.  Visual  DSD  [34,35]  has  been  used  to  automate  the 
'segment-level'  (tier  2)  design  and  analysis  of  systems  contain¬ 
ing  hundreds  of  distinct  species  [25,34],  However,  Visual  DSD 
currently  does  not  support  the  design  and  enumeration  of 
branched  junction  structures  (used  in  many  demonstrations 
of  self-assembly  [16]  and  molecular  computation  [21]),  nor 
does  it  support  enumeration  of  certain  reaction  types,  such 
as  four-way  branch  migration  or  branch  migration  with 
remote  toeholds  [36,37].  Qian  &  Winfree  [22,23]  have  pre¬ 
sented  a  compiler  for  their  'seesaw  gate'  systems,  which  has 
been  used  in  demonstrations  of  sophisticated  molecular  com¬ 
putation;  however,  this  abstraction  is  built  around  a  single 
structural  motif  (the  seesaw  gate)  and  is  therefore  of  limited 
use  for  applications  in  self-assembly  that  require  a  wider  var¬ 
iety  of  structural  features  (e.g.  hairpins,  branched  junctions, 
etc.).  The  NTJPACK  software  package  [8,25,30]  integrates  ther¬ 
modynamic  design  and  evaluation  of  nucleotide  sequences; 
however,  it  does  not  currently  provide  a  programming 
language  for  tier  1  (behavioural)  design,  nor  does  it  allow  for 
analysis  or  evaluation  at  the  segment  tier.  Therefore,  no  current 
package  integrates  a  full-featured  DNA  programming 
language  with  sequence  designers,  as  well  as  analysis  and 
verification  tools  at  the  segment  and  sequence  tiers.  See 
table  l  for  a  detailed  comparison. 

Here,  we  present  DyNAMiC  Workbench  (the  Dynamic 
Nucleic  Acid  Mechanism  Compiler),  which  provides  a  tightly 
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Table  1*  Comparison  of  currently  available  software  packages  for  dynamic  DIMA  nanotechnology.  / ,  feature  is  implemented  by  this  package;  x,  feature  is  not  Q| 
implemented  by  this  package;  and  /*,  feature  is  provided  by  an  external  package  with  which  this  package  directly  interfaces, 
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integrated  graphical  user  interface  for  all  three  tiers  of  design, 
as  well  as  automated  enumeration  and  analysis  of  potential 
interactions  at  the  segment  and  sequence  tiers.  Users  may 
begin  by  designing  an  abstract  behaviour  for  a  dynamic 
system  and  are  assisted  in  translating  that  behaviour  first  to 
segments,  then  to  nucleotide  sequences.  In  this  way,  the 
design  process  is  hierarchical  progressing  through  several 
layers  of  abstraction  towards  the  ultimate  implementation. 
However,  the  user  may  easily  enter  or  exit  the  software  at 
any  stage  of  the  design  process,  perform  analysis  in  silica 
and  make  changes  to  the  design  based  on  the  results  of 
those  tests.  In  this  way,  the  design  process  can  be  iterative. 

Furthermore,  DyNAMiC  Workbench  supports  integration 
with  a  number  of  commonly  used  software  packages  for  each 
tier,  allowing  the  user  to  'mix-and- match'  different  aspects  of 
preferred  software.  Finally,  the  system  also  includes  a  pluggable 
framework  for  expansion  and  inclusion  of  new  tools  and  inter¬ 
faces— for  instance,  a  kinetic  simulation  package  could  be  easily 
added.  The  software  is  deployed  as  a  free  Web  service,  availa¬ 
ble  at  http://molecular-systems.net/workbench,  providing 
a  cross-platform  graphical  interface  without  requiring  installa¬ 
tion  of  new'  software;  a  downloadable  version  is  also  available 
(at  the  same  URL)  to  support  local  installations  if  preferred. 
We  believe  that  the  inherent  flexibility  in  this  design  process, 
combined  with  the  deep  integration  between  the  various 
tools,  will  help  eventually  enable  fully  automated  design  of 
dynamic  systems.  Expanded  and  improved  in  silica  design 
and  analysis  will  allow  better  experimental  results  with  fewer 
experimental  design  iterations. 


2.  Designing  a  system  with  DyNAMiC  Workbench 

To  illustrate  the  various  features  of  DyNAMiC  Workbench 
and  the  tiers  of  a  typical  design  process  for  a  dynamic  nucleic 
acid  system,  we  first  present  a  sketch  of  the  process  for 
designing  a  triggered,  catalytic  three-arm  branched  junction, 
which  has  been  experimentally  demonstrated  in  [16].  We  also 


provide  four  additional  design  examples  in  the  electronic 
supplementary  material. 

We  adapt  the  'Nodal'  formalism  [16]  to  describe  the  design 
procedure,  following  the  three-tiered  process  outlined  above. 
The  terminology  and  abstractions  of  the  Nodal  formalism  are 
summarized  in  figure  2.  This  formalism  maps  abstract  behav¬ 
ioural  units  ('node  types',  figure  3rt)  to  concrete  molecular 
implementations  ('molecule  types',  figure  3fr) — DNA  strands 
or  complexes  of  strands.  Once  a  node  type  has  been  defined, 
many  'Nodal  species' — instances  of  the  node  type — may  be 
created.  The  process  that  we  wish  to  demonstrate— triggered 
catalytic  three-arm  junction  formation — is  shown  in  figure  3c; 
we  will  implement  this  process  by  instantiating  node  types 
and  composing  Nodal  species  (figure  3d ),  The  Nodal  language 
allows  a  complicated  behavioural  process  to  be  described  by 
composing  small,  reusable  modules,  much  like  functions  in  a 
programming  language  (tier  1).  DyNAMiC  Workbench  can 
then  use  the  underlying  node  type  definitions  to  automati¬ 
cally  generate  a  complete  segment-level  (tier  2)  molecular 
implementation  (figure  3c).  Ultimately,  sequences  are  designed 
for  each  of  these  molecular  species  (tier  3),  and  many  DNA 
molecules  are  produced  for  each  species  by  commercial 
oligonucleotide  synthesis  (figure  3f). 

The  connectivity  of  our  target  structure  (the  three-arm 
junction) — along  with  our  prescription  that  the  assembly  pro¬ 
cess  be  stepwise,  triggered  and  catalytic  (see  below) — suggests  a 
particular  architecture  of  nodes.  DyNAMiC  Workbench  cur¬ 
rently  provides  a  pool  of  approximately  20  pre-defined 
node  types,  based  on  previous  work  [16,20,40]  (electronic 
supplementary  material,  figure  S4);  many  of  these  node 
types  have  tested  DNA  implementations.  For  this  example, 
we  choose  hairpins  for  the  monomers  and  a  linear  strand 
without  secondary  structure  for  the  initiator  (node  types 
mO  and  ml).  Torts'  on  the  nodes  represent  'domains'  of 
the  underlying  molecular  implementation  (see  figure  2  for 
terminology);  the  pattern  of  desired  interactions  between 
domains  of  the  molecular  species  is  encoded  in  the  connec¬ 
tions  between  ports  on  the  nodes.  Each  domain  in  turn  is 
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Figure  2.  Abstractions  and  their  definitions.  (0)  A  strand  or  set  of  strands  with  a  defined  behaviour  may  be  represented  as  a  'node',  as  described  by  Yin  et  of.  [16]. 
!n  a  node,  domains  are  drawn  as  coloured  'ports'.  A  triangle  represents  an  Input  port;  which  can  trigger  opening  of  a  circular  'output  porf .  Output  ports  may  in 
turn  bind  to  downstream  input  ports,  (6)  Nucleic  add  strands  may  be  drawn  as  lines;  behavtourally  relevant  portions  of  the  molecule  are  called  'domains'  and 
represented  by  capital  letters;  we  have  highlighted  an  input  domain  A  in  orange  and  an  output  domain  B  in  blue— corresponding  to  the  ports  on  the  node  pictured 
in  panel  (0).  (r)  'Segments'  represent  contiguous  regions  of  several  nucleotides  that  act  as  discrete  units  of  complementarity  and  are  labelled  by  numbers  or 
lowercase  fetters.  For  instance,  segment  4  (highlighted  in  yellow  in  panel  tf))  has  the  sequence  XMTCCAA';  each  domain  can  comprise  multiple  segments, 
(d)  Bubbles  represent  individual  nucleotides;  hashes  indicate  base  pairing,  and  an  arrow  indicates  the  3'  end  of  a  strand. 


subdivided  into  several  'segments' — continuous  regions  of 
bases  that  are  designed  to  act  as  discrete  units.  Figure  3g 
shows  the  intended  execution  of  the  Nodal  reaction  graph 
presented  in  figure  3d. 

The  target  structure  in  this  example,  a  three-arm  junction 
(figure  3c),  is  designed  to  be  formed  in  a  kinetically  guided 
pathway  out  of  three  monomer  species  and  an  initiator 
species  (figure  3e),  In  the  absence  of  the  initiator,  the  mono¬ 
mers  are  to  be  metastable;  this  is  indicated  by  drawing 
their  output  ports  in  a  closed  state  (filled  circles).  Since 
the  monomers  are  metastable,  the  initiator  is  required  for 
target  structure  formation  ('triggered').  The  Nodal  language 
allows  for  two  types  of  reactions:  'assembly'  reactions  occur 
between  an  open  input  and  an  open  output  port,  while  'dis¬ 
assembly'  reactions  occur  between  an  open  output  and  a 
closed  input.  In  the  intended  reaction  pathway  (figure  3#), 
the  initiator  nO  reacts  with  node  nl  via  an  assembly  reaction; 
this  binding  causes  the  output  port  on  nl  (circle)  to  be 
switched  from  closed  to  open,  reconfiguring  nl  and  allowing 
it  to  react  with  node  n2  via  another  assembly  reaction.  The 
sequential  opening  of  ports  a  Dows  the  assembly  process  for  a 
single  species  of  the  target  to  proceed  via  several  steps  in  a  pre¬ 
scribed  order  ('stepwise').  Once  reacted  with  n2,a  reconfigured 
n3  can  displace  the  initiator  nO  through  a  disassembly  reaction; 
this  reaction  frees  the  initiator  to  react  with  another  copy  of 
node  nl,  implementing  multiple  turnovers  ('catalytic'). 

This  Nodal  reaction  pathway  can  be  compiled  to  the  more 
detailed  reaction  mechanism—shown  at  the  segment  level  in 
figure  3 h  (compare  each  step  to  the  Nodal  mechanism  in 
figure  3g).  The  initiator  nO  first  binds  to  hairpin  nl  via  the  inter¬ 
action  between  the  'toehold'  segments  4  and  4*  (where  2*  is  the 
Watson-Crick  complement  of  segment  #).  After  nl  and  nO  are 
thus  co-localized,  the  3,  2  and  1  segments  on  nO  can  displace 
their  analogues — hybridizing  to  the  3*,  2*  and  l*  segments — 
via  a  process  known  as  'toehold-mediated  branch  migration' 
[41  “44].  At  the  conclusion  of  branch  migration,  monomer  nl 
is  reconfigured  so  that  the  toehold  segment  1*  is  no  longer 
sequestered  in  the  duplex  and  thus  becomes  available  for  a 
similar  downstream  reaction  with  hairpin  n2.  This  process 
continues  similarly  for  the  remaining  species. 


The  Nodal  design  can  be  easily  constructed  by  the  user  via 
a  drag-and-drop  graphical  interface.  After  completing  the 
system  design  using  the  tier  1  Nodal  formalism,  the  user  invokes 
the  DyNAMiC  Workbench  compiler,  which  automatically  gen¬ 
erates  a  valid  set  of  tier  2  segment  labels  in  order  to  implement 
the  behaviour.  The  software  can  also  identify  potential  inter¬ 
actions  between  molecules  due  to  complementary  segment 
interactions  ('reaction  enumeration') — including  spurious  reac¬ 
tions  that  may  necessitate  a  redesign  (§3.4.1).  The  reaction 
enumerator  in  DyNAMiC  Workbench  was  used  to  generate 
the  mechanism  shown  in  figure  3fi.  The  segment-tier  (tier  2) 
descriptions  capture  all  relevant  structures  and  interactions  of 
the  system,  assuming  that  all  non-com  plementary  segments 
have  effectively  orthogonal  sequences  (so  as  to  not  significantly 
interact  with  one  another). 

In  reality,  some  degree  of  spurious  interaction  between 
segments  is  unavoidable,  since  real  sequences  are  not  perfectly 
orthogonal;  careful  sequence  design  is  needed  in  order  for  the 
system  to  implement  the  behaviour  as  intended— both  ensur¬ 
ing  desired  interactions  are  favourable  and  minimizing 
undesired  side  reactions  that  may  occur  between  segments — 
both  within  or  across  different  molecules  [27].  For  this  step, 
DyNAMiC  Workbench  converts  the  segments  generated  in 
the  previous  tier  into  poly-N  sequences  of  the  proper  lengths. 
The  poly-N  sequences  are  then  mutated  into  a  set  of  non¬ 
interacting  sequences  (a  tier  3  design)  using  one  of  several 
existing  software  tools:  Domain  Design  (DD)  [29],  NUPACK 
[25,30]  or  Multisub jective  [45]  (§3,3). 

Finally,  DyNAMiC  Workbench  interfaces  with  tools 
[25,32,36]  to  perform  thermodynamic  calculations  as  heuristic 
tests  to  assess  the  quality  of  the  final  sequences.  For  instance, 
we  may  compute  the  MFE  structures  of  each  of  the  strands  or 
complexes  in  our  ensemble  to  verify  that  they  adopt  the 
intended  secondary  structures  in  their  monomeric  form. 
Additionally,  DyNAMiC  Workbench  can  be  instructed  to 
look  for  strong  pairwise  interactions  between  species  which 
are  intended  to  be  non-interacting.  Once  the  user  is  satisfied 
that  the  sequences  will  implement  the  intended  comple¬ 
mentarity  scheme,  DyNAMiC  Workbench  can  export  the 
sequences  of  all  relevant  DNA  strands;  these  sequences  can 


rsif.royalsodetypublishing.org  J.  R.  Soc.  Interface  12:  20150580 


(a) 


node  type 


(b) 

manual 

definition 


Figure  3,  Design  example.  The  Nodal  formalism  [16]  maps  (cr)  simple  behavioural  units  (nodes)  to  (6)  molecular  architectures.  Commonly  used  molecular  motifs,  or 
'molecule  types',  may  be  expressed  as  node  types  by  describing  the  primary  and  secondary  structure  of  the  molecule  type  and  then  assigning  functional  roles 
(e.g.  input  or  output}  to  the  molecule's  domains,  (f)  Our  target  process  is  the  conversion  of  a  set  of  metastab ie  hairpins  into  a  three-arm  junction,  with  a 
single-stranded  initiator  serving  as  a  trigger,  (d)  Behavioural  description  of  our  catalytic  three-arm  branched  junction,  described  by  composing  behavioural 
nodes.  Once  a  node  type  has  been  defined,  multiple  Nodal  species  with  the  given  node  type  may  be  instantiated  and  connected  together.  Each  species  of  a 
given  type  will  have  the  same  basic  structure,  but  will  correspond  to  a  distinct  molecular  species  with  a  unique  sequence  identity.  Connections  between 
Nodal  species  (arrows)  represent  desired  behavioural  interactions  between  their  domains,  (e)  These  behavioural  interactions  also  thus  imply  Watson- Crick  com¬ 
plementarity  relationships  between  the  domains.  DyNAMiC  Workbench  can  use  the  node  type  definition  to  automatically  map  a  set  of  Nodal  species  to  a  set  of 
molecular  species  whose  sequence  complementarity  relationships  implement  the  intended  behavioural  interactions.  This  panel  shows  the  segment-level  represen¬ 
tation  of  the  hairpin  monomer  and  initiator  species  which  will  make  up  our  system.  ( f)  Sequences  may  be  designed  to  implement  the  molecular  species,  and  DNA 
molecules  may  be  produced  by  commercial  oligonucleotide  synthesis,  (g)  Intended  'execution'  of  the  Nodal  complementarity  graph.  {/?)  Segment-level  enumeration 
of  possible  reactions  between  four  starting  complexes.  Short,  single-stranded  regions  at  the  hairpin  termini  serve  as  nudeation  sites — 'toeholds'  to  prime  branch 
migration  reactions.  The  opening  of  the  hairpin  by  branch  migration  exposes  new  toeholds,  implementing  the  cascade. 


then  be  directly  provided  to  a  commercial  supplier  of 
oligonucleotides  for  synthesis. 


3.  Methods  and  implementation 

We  have  developed  and  unified  graphical  tools  for  designing 
and  analysing  systems  at  the  three  tiers  of  abstraction.  Figure  1 
describes  the  various  tools  we  have  implemented  and/or  inte¬ 
grated/  and  several  tools  that  could  be  implemented  in  the 
future,  as  well  as  the  intended  workflow  for  a  user  developing 
a  system  with  DyNAMiC  Workbench.  This  section  describes 
the  input  to,  computation  performed  by,  and  output  from 
each  of  these  tools  in  detail,  as  well  as  discussing  important  fea¬ 
tures  of  the  tools  and  their  interfaces*  Figure  4  shows  screenshots 
of  various  tools. 


3.1.  Tier  1:  behavioural/Nodal  design 

As  described  above,  the  Nodal  formalism  maps  abstract 
behavioural  units  (node  types)  to  known  molecular  implemen¬ 
tations*  The  input  to  the  Nodal  compiler  is  a  set  of  instances 
(Nodal  species)  of  these  node  types,  along  with  a  prescribed 
connectivity  between  the  ports  of  Nodal  species.  Input  may 
be  constructed  by  a  'drag-and-drop'  graphical  interface 
(figure  4a  and  electronic  supplementary  material,  figure  SI), 
or  by  describing  the  system  using  a  text-based  language 
called  DyNAML  (the  Dynamic  Nucleic  Acid  Markup 
Language)  [46].  The  hairpin  monomers  and  the  single-stranded 
initiator  described  above  (§2)  are  two  examples  of  molecule 
types.  In  the  example  of  the  previous  section,  there  were 
three  distinct  'species'  of  the  hairpin  monomer  molecule 
type*  Like  the  three  distinct  hairpin  monomers,  each  corre¬ 
sponding  Nodal  species  will  have  a  unique  identity,  but  will 
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Figure  4.  DyNAMiC  Workbench  screenshots.  (High-resolution  image — view  PDF  for  details.)  (a)  Modal  (tier  1)  design  interface.  Systems  are  composed  by  dragging- 
and-dropping  nodes  from  the  palette  on  the  left  1,  then  connecting  nodes  in  the  centre  panel  2.  The  right  panel  shows  a  preview  of  the  molecular  implementation 
3.  See  electronic  supplementary  material  figure  S1r  for  details,  (b)  Segment-level  (tier  2)  design  interface.  Centre  panel  shows  secondary  structure  view  of  each 
complex  in  the  system  4.  Lower  panel  lists  sequences  and  composition  of  strands  in  the  system  5.  Right  panel  shows  name  and  sequence  of  each  distinct  segment  in 
the  system  6.  See  electronic  supplementary  material  figure  52,  for  details,  (r)  Multisubjective  sequence  (tier  3)  design  interface.  Similar  to  the  tier  2  design  inter¬ 
face,  different  panels  show  complexes,  strands  and  segments.  The  secondary  structure  view  also  highlights  unintended  interactions  and  shows  bases  flagged  for 
modification  by  the  analysis.  See  electronic  supplementary  material,  figure  S3,  for  details  and  images  of  other  sequence  design  interfaces,  (d)  Reaction  enumerator 
interface.  Rectangular  nodes  represent  complexes,  joined  by  circular  nodes  representing  reactions  between  intermediates.  See  figure  5  for  details. 


share  the  same  basic  shape  with  the  other  species  of  the  same 
type.  In  the  same  way,  generated  molecular  species  of  the 
same  type  will  have  different  sequences  but  similar  dimensions 
and  secondary  structures. 

The  Nodal  compiler  uses  the  definition  of  each  node  type 
(the  mapping  to  a  known  molecular  implementation),  as  well 
as  the  network  of  connections  between  Nodal  species,  to  pro¬ 
duce  a  segment-level  representation  of  the  system— such  that 
connections  between  nodes  are  implemented  as  appropriate 
Watson -Crick  complementarity  relationships  between  seg¬ 
ments.  To  do  tills,  the  Nodal  compiler  first  creates  an 
instance  of  each  molecular  species  (one  for  each  Nodal  species 
in  the  system),  then  labels  the  segments  in  the  molecular 
species  such  that  they  satisfy  the  prescribed  connections 
between  nodes.  The  compiler  begins  with  an  initiator  node, 
then  traverses  the  network  of  connected  nodes  in  a  breadth- 
first  search  until  all  segments  are  labelled.  The  compiler  will 
return  an  error  if  two  complementarity  statements  are  in  con¬ 
flict  and  cannot  be  fulfilled  (e.g.  if  one  connection  requires  a 
segment  be  labelled  7  and  another  connection  requires  that  it 
be  labelled  7*),  or  if  a  complementarity  statement  exists 
between  domains  of  incompatible  shape  (e.g.  if  a  domain  A, 


comprising  an  8-nt  segment,  is  to  be  complementary  to  a 
domain  B  which  contains  an  8-nf  segment  and  a  2-nt  segment). 

The  output  from  the  Nodal  compiler  is  a  segment-level 
(tier  2)  representation  of  the  system,  encoded  in  the 
DyNAML  Intermediate  Language  (DTL)  [46].  This  represen¬ 
tation  can  be  further  edited  by  the  user  or  converted  to  a 
sequence- level  (tier  3)  representation. 

The  Nodal  language  and  compiler  are  highly  general — 
rather  than  enforcing  specific  invariants  on  the  form  or  structure 
of  the  underlying  molecular  primitives,  or  supporting  only 
specific,  p re-defined  molecular  motifs,  our  compiler  allows 
the  inclusion  of  systems  with  arbitrary  nucleic  acid  secon¬ 
dary  structures  and  behaviours.  This  flexibility  is  essential  lo 
supporting  the  previously  discussed  hairpin  motifs,  as  well 
as  more  complex  multi-stranded  motifs  (such  as  branched 
junctions) — both  of  which  are  important  for  structure  formation 
applications.  The  language  also  allows  arbitrary  new  motifs 
to  be  developed,  DyNAMiC  Workbench  contains  a  collection 
of  built-in  node  types  with  DNA  implementations  based 
on  published  literature  (electronic  supplementary  material, 
figure  S4),  but  also  supports  definition  of  new  node  types. 
A  new  node  type  can  be  defined  by  the  following  procedure: 
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the  segment-wise  primary  and  secondary  structure  of  the  under¬ 
lying  molecule  type  are  described,  the  molecule's  segments  are 
grouped  into  domains  and  the  domains  are  assigned  functional 
roles  such  as  'input'  and  'output'.  New  node  types  can  also  be 
defined  from  portions  of  existing  systems.  The  ability  to  create 
'composite  node  types'  allows  designed  systems  to  be  extended 
and  re-used,  by  easily  abstracting  portions  of  an  existing  system 
into  reusable  components. 

The  software  environment  is  built  to  be  intuitive  and  easily 
usable,  but  also  to  provide  detailed  information  to  advanced 
users  if  necessary.  A  graphical  interface  (electronic  supple¬ 
mentary  material,  figure  Slfi)  reduces  the  barrier  to  entry  for 
those  unfamiliar  with  computer  programming.  Users  construct 
Nodal  species  by  dragging  and  dropping  from  a  palette  of  pre¬ 
defined  and  custom  node  types.  New  node  types  with  new 
molecular  implementations  can  be  defined  by  using  a  graphical 
interface  (electronic  supplementary  material,  figure  SI  b)  or  by 
selecting  components  of  a  Nodal  program  and  exposing  rel¬ 
evant  domains  on  the  nodes  (electronic  supplementary 
material,  figure  Si).  For  example,  to  define  a  reusable  three- 
arm  junction  node  type,  the  user  could  first  define  a  three-arm 
junction  system  by  composing  nodes,  then  'wrap'  that  system 
to  define  a  new  motif,  exposing  one  input  port.  New  node 
types  can  also  be  defined  using  DyNAML.  The  interface  pro¬ 
vides  real-time  feedback,  generating  interactive  previews  of 
the  compiled  molecular  species  and  highlighting  errors  (as 
described  above). 

The  Nodal  language,  compiler  and  design  interface  are 
described  in  detail  In  [46]  > 

3.2.  Tier  2:  segment  design 

A  tier  2  design  comprises  a  description  of  each  of  the  strands  in 
a  system,  a  division  of  the  strands  into  segments,  a  statement  of 
which  segments  should  be  complementary  (or  equal)  to  one 
another  and  an  intended  initial  secondary  structure  for  each 
of  the  strands.  Essentially,  a  tier  2  design  captures  all  of  the 
desired  inter-mo]  ecu  la  r  and  intra-molecular  interactions 
between  species  in  the  system  without  assigning  specific 
nucleotide  sequences  to  segments.  If  the  user  has  described 
their  initial  behavioural  design  using  the  Nodal  formalism, 
DyNAMiC  Workbench  can  automatically  generate  a  tier  2 
design  as  discussed  above.  The  user  may  also  choose  to 
begin  the  design  process  at  the  segment  tier  and  use  DyNAMiC 
Workbench  primarily  for  sequence  design  (§3.3);  in  this  case, 
the  user  can  input  a  tier  2  design  directly  using  a  graphi¬ 
cal  interface  (figure  4fr),  or  using  a  number  of  standard 
text-based  input  formats. 

DyNAMiC  Workbench  uses  the  DEL  as  a  common  inter¬ 
mediate  format  for  the  tier  2  design;  a  graphical  interface 
allows  the  user  to  modify  the  design  at  this  tier  and  to  transfer 
the  design  to  various  other  tools  (electronic  supplementary 
material,  figure  S2).  The  user  may  change  the  number 
of  nucleotides  in  existing  segments,  or  impose  specific  con¬ 
straints  on  the  segments  to  be  generated  in  Her  3.  The  user 
may  inspect  and  modify  the  primary  and  secondary  structures 
of  the  complexes  in  the  system.  The  user  may  send  the  tier  2 
design  to  in  silica  analysis  tools,  such  as  a  segment-level  reaction 
enumerator  (§3.4.1).  Similarly,  the  tier  2  design  may  be  trans¬ 
ferred  to  a  sequence  designer  (tier  3)  to  generate  specific 
nucleotide  sequences  that  implement  the  design;  this  process 
is  described  next. 


3.3.  Tier  3:  sequence  design 


The  input  to  each  of  the  'sequence  designers'  is  a  segment-level 
(tier  2)  representation  of  the  system,  along  with  any  constraints 
or  restrictions  specified  by  the  user  (for  instance,  the  user  may 
want  to  hold  the  sequence  of  some  particular  segment  fixed  in 
order  to  incorporate  a  restriction  enzyme  cutting  site). 

Sequence  design  is  the  process  of  generating  sequences  of 
nucleotides  to  implement  a  particular  set  of  complementarity 
and  orthogonality  relationships  between  nucleic  acid  strands. 
Several  sequence  design  methodologies  exist  [27].  One 
approach  uses  calculations  of  the  partition  function  [47-50] 
for  an  ensemble  based  on  a  detailed  thermodynamic  model 
of  nucleic  acid  secondary  structure  [51].  This  method  attempts 
to  maximize  the  probability  that  the  ensemble  adopts 
the  intended  secondary  structure(s),  while  minimizing  the 
chance  that  unintended  interactions  occur  [27].  Recent  w'ork 
has  extended  this  paradigm  to  entire  test  tubes  of  complexes 
[52].  It  should  be  noted  that  these  thermodynamic  designers 
do  not  explicitly  consider  kinetic  behaviour  and  that  sequences 
with  desirable  thermodynamics  may  have  undesirable  kinetic 
properties  [29],  One  objective  of  future  work  is  to  explicitly 
incorporate  kinetic  models  in  sequence  design. 

Earlier  methods  focused  on  minimizing  sub-sequence 
repeats  or  maintaining  a  minimum  edit  distance  between 
non-complementary  sequences  in  an  ensemble — an  approach 
known  as  'sequence  symmetry  minimization'  [26,53],  Evol¬ 
utions  of  this  approach  [28,29]  combine  sequence  symmetry 
minimizaHon  with  some  insights  from  thermodynamic 
models,  as  well  as  various  heuristic  methods,  to  produce 
sequences  that  meet  additional  criteria  desirable  for  dvna- 
mic  systems.  These  recent  heuristic  approaches  may  have 
favourable  computational  complexity  compared  to  the  thermo¬ 
dynamic  approaches  (scaling  quadratically  with  the  number  of 
segments  or  strands,  rather  than  cubically  with  the  number 
of  bases),  but  the  actual  computation  time  depends  on  the 
algorithm  and  the  implementation.  Finally,  recent  software 
combines  these  two  approaches  by  passing  limited  subsets  of 
a  system  between  various  designers — while  independently 
analysing  sequences  according  to  a  variety  of  criteria  (e.g. 
attempting  to  explicitly  eliminate  secondary  structure  in  key 
'toehold'  regions)  [45]. 

Rather  than  selecting  a  single  design  methodology,  our  soft¬ 
ware  incorporates  and  provides  interfaces  to  various  software 
tools  that  implement  these  different  approaches  (figure  4c  and 
electronic  supplementary  material,  figure  S3).  First,  we  have 
extended  the  core  algorithms  of  Zhang's  DD  package,  which 
performs  heuristic,  'segment-based  sequence  design'  [29],  to 
add  flexible  stopping  conditions  and  increase  sequence  diver¬ 
sity.  Segment-based  sequence  design  uses  an  approach  similar 
to  sequence  symmetry  minimization  to  design  individual 
segments,  which  are  then  'threaded'  together  to  form  full 
strands.  In  DD,  a  scoring  function /hill-climbing  algorithm  is 
used  to  combine  this  technique  with  heuristic  metrics  of  features 
believed  to  be  important  for  dynamic  systems.  We  have  also 
developed  a  graphical  interface  to  DD  (electronic  supplemen¬ 
tary  material,  figure  $3a).  This  interface  allows  the  user  to 
graphically  add,  remove  and  edit  segments,  as  well  as  to 
thread  these  segments  into  strands.  The  individual  segments 
and  the  full  design  can  be  visualized  in  real  time  as  DD 
automatically  tunes  segments. 

Additionally,  we  provide  a  direct  interface  to  the  NUPACK 
multi-objective  thermodynamic  sequence  design  [25]  Web 
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service  (electronic  supplementary  material,  figure  S3fr).  Several 
tools  in  our  package  generate  input  scripts  for  the  NUPACK 
designer,  which  can  then  be  readily  submitted  to  the 
NUPACK  Web  server. 

Finally,  we  have  developed  an  interface  to  the  *  Multisub  jec- 
tive'  sequence  design  package  [45]  (electronic  supplementary 
material,  figure  S3c).  Multi  subjective  uses  several  sequence 
design  packages  (the  NUPACK  multi-objective  designer  and 
DD)  to  eliminate  unintended  secondary  structure  from  key 
regions  of  strands  in  dynamic  systems.  This  is  an  iterative 
process — sequence  design  is  initially  performed  with  another 
sequence  designer  (DD  or  NUPACK),  then  sequences  are 
evaluated  by  Multisubjective.  Multisubjective  identifies 
unintended  secondary  structure,  long  regions  of  repeated 
nucleotides,  etc.  and  proposes  mutations  to  the  designed 
sequences.  The  sequences,  with  mutations,  aie  then  re¬ 
submitted  to  the  primary  sequence  designer  and  the  process 
is  repeated.  Our  interface  allows  the  user  to  inspect  the 
analysis  performed  by  Multisubjective  and  edit  the  suggested 
modifications  before  redesigning. 

We  note  briefly  that,  though  our  focus  is  on  DNA,  the  design 
considerations  are  essentially  identical  for  RNA  (though  ther¬ 
modynamic  designers  will  use  different  sets  of  parameters 
[51,54,55]).  All  three  packages  discussed  also  support  the 
design  of  RNA  sequences.  Design  and  evaluation  of  mixed 
DNA/RNA  sequences  is  outside  the  scope  of  current  work* 

The  output  of  each  of  these  sequence  designers  is  a  set  of 
nucleotide  sequences  for  each  of  the  complete  strands  in  the 
system.  These  sequences  can  be  exported  to  a  text  file  or  a 
spreadsheet,  which  can  be  used  to  order  sequences  from 
a  commercial  oligonucleotide  synthesis  provider. 


The  reaction  enumerator  is  described  in  greater  detail  in 
ref.  [56],  as  well  as  m  an  upcoming  publication. 


3.4.2.  Sequence-level  analysis 

Once  system-level  behaviour  has  been  designed  and  analysed, 
and  nucleotide  sequences  have  been  generated,  sequences 
can  be  analysed  to  determine  experimental  suitability.  A  var¬ 
iety  of  analyses  can  be  performed.  For  small  self-assembly 
systems,  the  MFE  structure  and  base  pair  probabilities  of  the 
entire  ensemble  can  be  determined  using  a  full  physical 
model  wThich  computes  the  partition  function  for  the  system 
[49,50].  For  larger  dynamic  systems  (beyond  tens  of  species 
or  hundreds  of  bases),  this  exhaustive  computation  becomes 
impractical.  However,  the  MFE  structure  and  base  pair  prob¬ 
abilities  of  individual  strands  or  small  subsets  of  strands  may 
still  be  examined  to  discover  unintended  secondary  structure 
within  strands  or  spurious  interactions  between  strands. 

Many  packages  exist  which  can  perform  these  compu¬ 
tations.  Our  software  Integrates  directly  with  several  publicly 
available  Web  services  for  sequence-level  analysis.  The 
NUPACK  Web  server  can  perform  full  partition  function  and 
pair  probability  calculation  [25].  The  Mfold  and  DINAmelt 
Web  servers  provide  several  types  of  analysis  [31,38].  The 
TBI  Vienna  RNAfold  Web  server  allows  computation  of 
the  MFE  structure  and  partition  function  [32].  These  software 
tools  provide  a  wide  range  of  options  and  allow  the  use  of  var¬ 
ious  available  thermodynamic  parameter  sets  for  both  DNA 
and  RNA.  DyNAMiC  Workbench  allows  users  to  easily 
submit  any  sequence  displayed  within  the  software  to  these 
Web  servers  for  analysis  through  a  simple,  unified  interface. 


3.4.  Analysis  tools 

3.4.1,  Enumerator 

Reaction  enumeration  is  the  process  of  predicting  the  net¬ 
work  of  possible  reactions  between  a  given  set  of  starting 
complexes.  For  dynamic  nucleic  acid  systems,  a  reaction 
enumeration  at  the  level  of  segments  (rather  than  individual 
nucleotides)  can  provide  valuable  insights  into  the  behaviour 
of  the  system  (by  culling  much  of  the  detail  of  a  sequence- 
level  enumeration  and  focusing  on  larger  interactions). 
Results  of  a  segment-level  enumeration  could  also  be  used 
to  perform  stochastic  or  analytical  mass-action  kinetic  simu¬ 
lations  of  the  reaction  networks  in  order  to  predict  physical 
behaviours  of  systems  in  vitro .  Finally,  segment-level  reac¬ 
tion  enumeration  could  be  used  for  formal  verification  of 
system  behaviour* 

We  have  built  an  interface  for  performing  segment-level 
reaction  enumeration  [56]  and  visualizing  the  results 
(figures  4 d  and  5)*  Our  interface  displays  a  graph  of  starting, 
intermediate  and  resting /end -state  complexes,  connected  by 
the  reactions  between  these  complexes.  The  interface  automati¬ 
cally  colours  fast  versus  slow'  reactions,  transient/ short-lived 
versus  resting/  long-lived  complexes  and  initial  complexes 
versus  enumerated  complexes.  The  graph  can  be  readily  tra¬ 
versed  by  zooming  and  panning,  and  the  interface  highlights 
relevant  substrates  or  reactions  proximal  to  a  complex  of  interest 
(for  instance,  all  reactions  producing  or  consuming  a  given  com¬ 
plex  can  be  highlighted)*  The  graph  can  be  re-arranged  by 
dragging  and  dropping  to  aid  visualization.  Each  complex  or 
reaction  can  also  be  isolated  and  visualized  as  an  interactive 
secondary  structure  diagram. 


3.5.  Utilities 

3.5.1.  Structure  visualization 

DyNAMiC  Workbench  includes  a  flexible  system  for  visualizing 
arbitrary  unpseudoknotted  nucleic  add  secondary  structures, 
which  was  used  to  generate  all  images  of  secondary  structure 
that  appear  in  this  paper.  The  basic  visualization  combines 
a  traditional  tree-based  'planar  graph'  [57,58]  layout  with  a 
Linear  representation  (allowing  branched  structures  to  be  more 
easily  visualized)  and  an  interactive  colouring  scheme.  This  visu¬ 
alization  is  used  throughout  the  software  to  display  secondary 
structures,  and  can  also  be  accessed  as  a  separate  utility, 

3*5,2.  Sequence  manipulation 

Finally,  a  suite  of  utilities  is  provided  in  DyNAMiC  Workbench 
to  perform  various  common  transformations  on  nucleic  acid 
sequence  strings.  These  range  from  Watson -Crick  complemen¬ 
tation  to  Levenshtein  distance  calculation  [59]  to  threading 
segments  into  strands.  The  user  can  also  quickly  perform 
sequence- level  thermodynamic  analysis  (through  the  interfaces 
described  above).  Finally,  sequences  can  be  exported  to/ 
Imported  from  various  standard  formats  (FASTA,  CSV,  etc.) 
These  utilities  are  available  in  context  anywhere  the  user  may 
interact  with  sequences. 

4.  Architecture 

The  software  is  deployed  as  a  Web  service — software  tools  are 
installed  on  a  server  and  managed  by  a  supervisor  layer,  which 
is  tightly  connected  to  a  Web  server  ( figure  6).  The  Web  server 
exposes  a  rich  client-side  interface.  The  supervisor  manages 
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Figure  5,  Graphical  interface  to  a  reaction  enumerator  [56].  (High-resolution  image — view  PDF  for  details.)  (g)  The  enumerator  calculates  a  network  of  possible  reactions  1 
between  the  starting  complexes,  as  well  as  the  possible  intermediate  complexes  formed.  The  user  can  pan  and  zoom  throughout  the  network  to  view  intermediate  complexes. 
Doubleclicking  a  complex  2  or  reaction  l  allows  its  structure  to  be  inspected.  The  reaction  network  shown  here  is  the  execution  of  the  three-arm  junction  system— a  detailed 
version  of  the  schematics  figure  3/i.  (b)  The  user  on  hover  over  components  of  the  graph  to  highlight  connections  to  neighbouring  components— in  this  case,  arrows  showing  the 
reactants  4, 5  of  a  bimolecular  association  reaction  (red  dot,  6)  are  highlighted  in  blue  and  purple,  while  the  arrow  showing  the  product  7  is  highlighted  in  red.  Additionally,  the 
graph  layout  can  be  re-arranged  by  dragging-and-dropping  complexes,  reactions  and  arrows.  These  interactive  features  allow  larger  reaction  graphs  to  be  explored  and  interpreted. 
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Figure  6*  Software  architecture,  (tf)  The  architecture  supports  a  rich  set  of  high-level,  client-side  applications.  These  applications  are  presented  graphically  to  the 
user.  (6)  A  Web  server  exposes  this  interface  and  handles  requests  from  the  client-side  applications,  dispatching  those  requests  to  the  computational  tools  and  the 
file  system,  (c)  A  wide  variety  of  file  formats  are  supported,  and  many  conversions  between  related  formats  can  be  performed  automatically.  These  files  are  then 
consumed  by  computational  tools,  which  may  run  on  the  same  machine  as  the  Web  server/supervisor,  or  be  located  remotely  (e  g,  run  on  a  duster),  tf )  The  Web 
interface  interacts  directly  with  tools  that  are  provided  independently  as  Web  services.  For  instance,  the  Web  interface  is  capable  of  submitting  requests  directly  to 
various  sequence  analysis  servers  [2531,32,38]. 


user  files,  converting  between  different  formats  and  launching 
computational  tools.  Examples  of  server-side  computational 
tools  include  the  Nodal  compiler  and  the  segment-level  enu¬ 
merator.  The  core  Web  server  and  supervisor  layers  are 
written  Ln  JavaScript  and  are  executed  by  NodeJS — an  evented, 
asynchronous  runtime  and  Web  server  based  on  Google's  VS 
JavaScript  engine. 

A  benefit  of  the  client-server  architecture  is  that  server- 
side  tools  can  be  written  in  any  language,  compiled  or 
deployed  for  a  single  architecture  (that  of  the  server),  and 
made  available  to  clients  running  on  many  platforms  through 
the  Web  interface.  Various  server  tools  are  currently  written 
in  JavaScript,  Python,  C  and  C++. 

The  client-side  interface  comprises  a  file  manager  and 
many  applications  that  provide  graphical  interfaces  to  the 
underlying  server  tools.  Examples  of  client-side  applications 
include  the  Nodal  designer,  DD  and  the  enumerator  interface. 
The  client-side  interface  is  written  in  JavaScript  and  uses  a  var¬ 
iety  of  open-source  user  interface  toolkits.  User  registration 
allows  designs  and  other  user  files  to  be  saved  on  the  server 
across  multiple  sessions. 

5.  Discussion 

We  have  presented  software  that  facilitates  a  design  process  that 
is  hierarchical  and  iterative,  A  user  designs  an  abstract  behaviour 


which  can  be  compiled  into  segments  and  then  sequences.  The 
ability  to  easily  enter  the  design  process,  to  perform  analysis, 
and  to  make  edits  to  the  design  at  any  tier  of  abstraction  (behav¬ 
ioural,  segment  or  sequence)  makes  DyNAMiC  Workbench  an 
effective  platform  for  iterative  design.  The  goal  of  this  flexible 
design  process  is  to  allow  for  fewer  experimental  design  iter¬ 
ations,  by  moving  in  silko  much  of  the  testing  and  'debugging' 
that  would  have  been  done  by  costly  and  time-consuming  lab¬ 
oratory  experiments.  We  have  shown  an  example  system 
designed  using  the  software,  and  we  present  four  additional 
designs  in  the  electronic  supplementary  material — a  larger, 
six-arm  junction  (electronic  supplementary  material,  §S2.1);  a 
bounded  dendrimeric  structure  (electronic  supplementary 
material,  §82.2);  an  exponentially-amplifying  catalytic  circuit 
(electronic  supplementary  material,  §S2,3);  and  a  self-assem¬ 
bling  three-dimensional  tetrahedron  (electronic  supplementary 
material,  §S2.4).  With  these  designs,  we  discuss  in  greater 
detail  the  practical  challenges  of  implementing  large  systems 
using  the  software. 

We  have  chosen  to  incorporate  several  existing  tools  for  each 
stage  of  the  design  process.  For  instance,  when  designing 
sequences,  users  can  easily  choose  one  of  the  several  tools  dis¬ 
cussed  above,  or  can  combine  the  results  of  several  tools.  The 
availability  and  interoperability  between  multiple  tools,  even 
for  the  same  task,  provides  two  benefits.  First,  different  tools 
may  be  better  suited  to  perform  a  given  task  under  different 
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circumstances;  for  instance,  DD  may  be  more  useful  for  design 
of  large  kinetic  systems,  while  a  thermodynamic  designer  such 
as  NUFACK  may  provide  a  more  rigorous  design  for  smaller 
systems.  Second,  easy  rapid  switching  between  different  tools 
will  allow  for  better  comparison  of  tools'  performance  and 
improved  characterization  of  tools'  strengths  and  weaknesses 
for  different  tasks. 

Our  software  is  deployed  as  a  publicly  available  Web 
service.  This  design  decision  was  motivated  by  several  obser¬ 
vations.  First,  installation  of  software  (especially  software 
which  requires  the  user  to  build  an  executable  from  source 
code,  or  to  use  the  command  line)  can  be  a  significant  barrier 
to  adoption.  Second,  use  within  the  field  of  a  variety  of  file  for¬ 
mats  can  require  tedious  manual  juggling  when  working  in 
traditional  command-line  or  desktop  interfaces.  Third,  many 
software  tools  in  the  field  require  significant  computational 
resources;  providing  these  tools  as  a  Web  service  allows  users 
to  use  or  at  least  experiment  with  these  tools  without  making 
the  significant  upfront  investment  in  computational  resources. 
The  Web-based  interface  allows  the  user  to  access  a  graphical 
interface  to  the  service,  using  any  device  with  a  modem  Web 
browser.  This  makes  the  service  truly  cross-platform  and  also 
obviates  the  need  for  the  user  to  install  any  software  or  to  main¬ 
tain  an  updated  version  of  the  software — updates  to  both  the 
interface  and  the  tools  themselves  can  be  centrally  managed. 
For  users  who  manage  their  own  research  computing  facilities, 
the  software  is  also  available  as  a  downloadable  package. 

The  software  has  been  designed  with  extensibility  in  mind; 
while  we  have  worked  to  integrate  many  existing  tools,  this  is 
only  a  starting  point.  Many  excellent  packages  exist  which  have 
not  yet  been  integrated.  To  this  end,  the  architecture  allows 
new  server-side  tools  to  be  added  and  new  client-side  appli¬ 
cations  to  be  easily  developed.  We  intend  this  software  and 
the  associated  service  to  serve  as  a  'clearinghouse'  for  dynamic 
DNA/RNA  nanotechnology  software  development  and 
deployment.  We  have  taken  several  steps  towards  this  goal: 
first,  the  entire  source  code  for  the  project  is  to  be  released 
under  the  GNU  General  Public  License,  We  hope  this  will 
encourage  users  to  modify  and  contribute  to  the  software,  in 
addition  to  encouraging  usage  of  the  public  Web  server 
Second,  we  have  developed  and  provided  extensive  documen¬ 
tation  of  the  application  programming  interface  (API)  for  the 
software,  to  facilitate  development  of  third  party  extensions; 
this  API  documentation  is  linked  to  from  within  DyNAMiC 
Workbench's  built-in  user  manual,  and  can  be  accessed  from 
the  main  'Flelp1 2 3  page  within  the  software. 

As  summarized  in  table  1,  there  are  many  possible  features 
that  are  yet  to  be  implemented.  Reaction  enumeration  could  be 


performed  at  the  behavioural  tier,  so  that  the  intended  reaction 
pathway  can  be  compared  directly  to  the  expected  reaction  path¬ 
way  at  the  segment  tier.  The  behavioural-  and  segment- tier 
reaction  enumerations  could  also  be  used  to  perform  kinetic 
simulations  and  determine  the  time-evolution  of  concentrations 
of  intermediate  complexes.  Electronic  supplementary  material, 
figure  S5,  demonstrates  some  of  these  possible  extensions. 

In  future  work,  we  intend  to  improve  capabilities  for  in  silico 
analysis,  as  well  as  provide  higher  level  design  tools  for  self¬ 
assembling  systems.  Specifically,  we  intend  to  incorporate 
various  simulation  tools  based  on  the  segment-level  enumer- 
ator—for  instance,  ODE-based  or  stochastic  kinetic 
simulators— as  well  as  separate,  base-level  kinetic  simulation 
tools  [39].  It  would  be  interesting  to  develop  tools  for  higher 
level  design  of  self-assembly  processes;  for  instance,  a  two-  or 
three-dimensional  'molecular  canvas',  which  would  eventually 
allow  users  to  quickly  and  easily  translate  two-dimensional 
Images  and  three-dimensional  structures  to  molecular  imple¬ 
mentations.  Expanding  individual  tools  to  better  handle  more 
diverse  structural  motifs  (for  instance,  pseudoknots,  which  are 
currently  prohibited  by  the  compiler  and  enumerator)  and 
reaction  types  is  another  area  for  future  work.  Finally,  in  the 
Nodal  compiler  included  with  DyNAMiC  Workbench,  there 
is  a  one-to-one  mapping  between  pre-defined  node  types 
and  corresponding  molecular  implementations.  However,  this 
need  not  be  die  case  for  every  behavioural  designer.  For  instance, 
a  future  behavioural  designer  could  operate  on  abstract  chemical 
reaction  networks,  adopting  one  of  several  translation  schemes 
[60,61]  to  convert  the  behaviour  into  a  molecular  structure.  We 
anticipate  many  such  future  designers  are  possible. 

Finally,  an  ongoing  goal  will  also  be  to  seek  out  and  integrate 
valuable  existing  tools;  we  hope  this  will  be  a  collaborative  effort 
embraced  by  the  entire  community. 
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SUMMARY 

Synthetic  gene  networks  have  wide-ranging  uses  in 
reprogramming  and  rewiring  organisms.  To  date, 
there  has  not  been  a  way  to  harness  the  vast  potential 
of  these  networks  beyond  the  constraints  of  a  labora¬ 
tory  or  in  vivo  environment.  Here,  we  present  an 
in  vitro  paper-based  platform  that  provides  an  alter¬ 
nate,  versatile  venue  for  synthetic  biologists  to  oper¬ 
ate  and  a  much-needed  medium  for  the  safe  deploy¬ 
ment  of  engineered  gene  circuits  beyond  the  lab. 
Commercially  available  cell -free  systems  are  freeze 
dried  onto  paper,  enabling  the  inexpensive,  sterile, 
and  abiotic  distribution  of  synthetic-biology-based 
technologies  for  the  clinic,  global  health,  industry, 
research,  and  education.  For  field  use,  we  create  cir¬ 
cuits  with  colorimetric  outputs  for  detection  by  eye 
and  fabricate  a  low-cost,  electronic  optical  interface. 
We  demonstrate  this  technology  with  small-molecule 
and  RNA  actuation  of  genetic  switches,  rapid  proto¬ 
typing  of  complex  gene  circuits,  and  programmable 
in  vitro  diagnostics,  including  glucose  sensors  and 
strain-specific  Ebola  virus  sensors. 

INTRODUCTION 

The  field  of  synthetic  biology  aims  to  re-engineer  the  molecular 
components  of  the  cell  to  harness  the  power  of  biology.  In  doing 
so,  synthetic  biologists  have  created  whole-cell  biosensors  (Ko- 
bayashi  et  al,  2004;  Kotula  et  al,  2014),  synthetic  probiotics, 
new  sources  of  drugs  (Fossati  et  aL,  201 4),  green  energy,  (Torella 
et  aL,  2013)  and  chemistry  (Zhang  et  ah,  2012;  Martin  et  aL, 
2003).  At  the  heart  of  many  of  these  cellular  technologies  are  syn¬ 
thetic  gene  networks  (Elowitz  and  Leibler,  2000:  Gardner  et  al.. 
2000),  which  are  dec  is  Ion -based  circuits  often  composed  of  a 
sensor  element  followed  by  a  transducer  that  regulates  a  meas- 
ureable  output  (Figure  1A).  With  sequence-specific  sensing  of 
nucleic  acids  and  small-molecule  recognition,  these  engineered 
logic  and  output  elements  hold  great  potential  for  broad  biotech¬ 
nology  and  medical  applications.  However,  the  application  of 
cell-based  synthetic  gene  networks  outside  of  the  laboratory 
has  been  restricted  by  concerns  of  biosafety  and  the  practicality 
of  the  cellular  host. 


Earlier  studies  in  the  area  of  in  vitro  synthetic  biology  and  cell- 
free  systems  have  made  important  contributions  to  our  under¬ 
standing  of  fundamental  molecular  biology  and  biochemistry 
and.  more  recently,  in  the  study  of  molecular  switch  dynamics 
and  complex  gene  circuits  (Hong  et  aL,  2014;  Karzbrun  et  al, 
2014;  Sun  et  al.,  2014;  Takahashi  et  al.,  2014).  These  efforts, 
however,  have  focused  on  solution- phase  reactions  using  fresh 
from  frozen  cel  I -free  systems  and  often  in  iiposomes  with  the 
goal  of  assembling  artificial  cells  (Kuruma  et  al,,  2009;  Kobori 
et  aL,  2013).  These  solution-phase  reactions  are  not  stable  or 
practical  for  handling  outside  of  the  lab  and  therefore  miss  the 
opportunity  to  leverage  the  abiotic  and  sterile  nature  of  these 
systems. 

Here,  we  report  a  method  for  embedding  cell-free  synthetic 
gene  networks  onto  paper  and  other  materials  for  in  vitro  appli¬ 
cations  in  the  clinic,  global  health,  industry,  research,  and  educa¬ 
tion.  This  is  achieved  by  freeze  drying  cell-free  systems  into 
paper  and  other  porous  substrates  to  create  materials  with  the 
fundamental  transcription  and  translation  properties  of  a  cell 
but  that  are  sterile  and  abiotic  (Figure  1A).  Stable  at  room  tem¬ 
perature,  these  embedded  materials  are  readily  stored,  distrib¬ 
uted,  and  can  be  activated  by  simply  adding  water.  We  begin 
by  demonstrating  the  successful  operation  of  synthetic  networks 
in  this  alternative  medium  for  biomoiecular  reactions  and  then 
move  on  to  show  rapid  screening  of  gene  constructs  and  appli¬ 
cation  of  the  paper-based  platform  for  programmable  in  vitro 
diagnostics,  including  strain-specific  Ebola  sensors 

RESULTS 

Development  of  Paper*Based  Technology 

To  begin,  we  tested  to  see  whether  the  enzyme  activity  required 
for  transcription  and  translation  could  be  reconstituted  from 
freeze-dried  cell-free  expression  systems,  which  normally  require 
storage  at  -80ftC.  The  commercially  available  PT7  expression 
system,  which  is  assembled  from  ribosomes  and  35  purified  bac¬ 
terial  proteins  (Shimizu  et  al. ,  200 1 ;  Shimizu  and  Ueda ,  201 0) ,  was 
freeze  dried  to  form  a  pellet,  reconstituted  in  water,  and  then  used 
to  transcribe  and  translate  a  constitutive  GFP  expression 
cassette.  Remarkably,  the  freeze-dried  and  reconstituted  PT7 
solution-phase  reactions  showed  activity  comparable  to  the 
fresh  from  frozen  PT7  reactions  (Figure  1 B),  This  approach  was 
then  extended  to  systems  with  greater  molecular  complexity 
based  on  whole-cell  extracts.  Using  expression  constructs 
dependent  on  either  E  cofi  RNA  polymerase  (RNAP;  S3Q)  or  T7 
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RNAP  (S3QT7),  constitutive  expression  of  GFP  was  supported  in 
small,  solution-phase  reactions  started  from  freeze-dried  pellets 
(Figures  Si  A  and  Si  B  available  online).  Importantly,  these  freeze- 
dried  pellets  are  stable  over  time,  with  PT7  transcription  and 
translation  activity  remaining  after  a  year  of  room  temperature 
storage  (Figure  1 C). 

Although  freeze-dried  preparations  offer  great  potential  for  the 
storage  of  poised  synthetic  biology  tools,  the  handling  of  solu¬ 
tion-phase  reactions  is  awkward  for  applications  outside  of  the 
lab.  To  address  this,  we  embedded  the  cell-free  systems  and 
synthetic  gene  networks  onto  paper  and  other  porous  materials 
(Figure  1  A).  Previously  used  in  measuring  pH  and  more  recently 
in  chemistry- based  diagnostics  (Martinez  et  al.,  2007),  we  spec¬ 
ulated  that  paper  could  also  serve  as  high-capillary-action  matrix 
to  hold  small-volume  molecular  and  biochemical  reactions.  To 
begin,  we  tested  whether  simple  cell-free  expression  could  be 
supported  while  embedded  in  the  cellulose  matrix  of  paper. 

Small  2  mm  filter  paper  discs  were  freeze  dried  with  bacterial 
cell-free  systems  containing  either  £  coti  (S30)  or  T7  RNAPs 
(S30  T7,  PT7)  and  a  corresponding  GFP  expression  plasmid 
(Figure  ID),  For  all  experiments,  unless  otherwise  specified, 
the  gene  construct  or  network  was  freeze  dried  as  a  DNA  copy 
into  paper  discs,  along  with  the  appropriate  cell -free  system 
(Figure  S1G).  After  being  freeze  dried  and  rehydrated,  these  pa¬ 
per-based  reactions  yielded  consistent  GFP  fluorescence  under 
the  regulation  of  either  RNAP,  Fluorescence  imaging  of  the  paper 
discs  confirm  GFP  expression,  and  reactions  were  monitored  by 
placing  the  paper  discs  at  the  bottom  of  black,  dear- bottom 
384-well  plates  for  incubation  in  a  37°G  plate  reader  (Figure  1 E). 

Synthetic  Gene  Networks 

Next,  we  tested  inducible  expression  systems  to  determine 
whether  more  complex  synthetic  gene  networks  could  be  sup¬ 
ported  by  the  freeze-dried,  paper-based  reactions  (Figure  1A), 
We  started  with  tetO- regulated  expression.  Regulation  of  this 
system  is  mediated  by  the  Tet  repressor  (TetR)  that  binds  to 
the  tetO  promoter,  preventing  transcription  (Figure  IF;  Lutz 
and  Bujard,  1997).  Such  regulation  performed  in  vitro  required 
that  constitutive  TetR  expression  also  be  encoded  into  the  syn¬ 
thetic  gene  network.  Expression  is  then  induced  by  addition  of 
the  tetracycline  analog  anhydrotetracycline  (aTc),  which  disrupts 
TetR  binding  to  the  promoter,  allowing  transcription  of  the  regu¬ 
lated  gene. 

Freeze-dried  discs  were  prepared  with  a  cell-free  system 
containing  S3Q  F.  cofi  cell  extract,  pretranslated  TetR,  and  the 


network  elements  encoding  the  constitutive  expression  of 
TetR  and  tetO-regulated  GFP  or  mCherry.  Varying  only  the  pres¬ 
ence  of  aTc,  discs  were  rehydrated  and  incubated  at  37° C.  The 
aTc-induced  discs  yielded  expression  of  GFR  and  mCherry  at 
5-  and  32-fold,  respectively  (Figures  1 G,  SI  C,  and  SI  D).  To  opti¬ 
mize  performance,  we  found  that  regulatory  control  from  the 
tetO  promoter  could  be  tightened  if  cell  extracts  were  supple¬ 
mented,  prior  to  freeze  drying,  with  TetR  to  control  promoter 
leakage  prior  to  the  expression  of  the  encoded  TetR  (Figures 
SI  E  and  S1F).  This  successful  TetR  augmentation  of  cell-free 
systems  prior  to  freeze  drying  suggests  that  protein  doping 
may  be  a  useful  way  to  tailor  and  enhance  the  regulatory  and 
enzymatic  environment  of  cell-free  applications. 

An  important  trend  for  synthetic  biology  ts  the  use  of  RNA- 
actuated  gene  circuits  (Gallura  et  al.,  2012).  Unlike  circuits  based 
on  small- molecule- based  regulation,  which  are  limited  by  the 
number  of  available  unique  recognition  domains,  RNA-based 
switches  can  be  rationally  programmed  to  recognize  se¬ 
quence-specific  triggers  and  accordingly  have  the  potential  to 
offer  essentially  unlimited  design  and  signaling  space  (Isaacs 
et  al.,  2004;  Caliura  et  al.,  2010;  Lucks  et  al.,  2011;  Mutalik 
et  al,.  2012).  Here,  we  test  a  new  generation  of  riboregulators 
in  an  in  vitro  demonstration  of  toehold  switches  (Green  et  al., 
2014  [this  issue  of  Cetf]).  These  robust  bio  molecular  switches 
provide  tight  translational  regulation  over  transcripts  and  exhibit 
excellent  orthogonality.  Moreover,  as  we  will  demonstrate,  by 
controlling  the  expression  of  alternative  RNA  polymerases 
(e.g.,  T3  RNAP),  toehold  switches  can  also  trigger  the  synthesis 
of  additional  switch  RNAs.  enabling  cascades  and  more  sophis¬ 
ticated  synthetic  circuits. 

The  rational  programmability  of  toehold  switches  comes  from 
their  design  (Figure  2A).  Riboregulators  are  composed  of  two 
cognate  RNAs:  a  transducer  RNA  that  encodes  the  output  signal 
of  the  system  (e  g,,  a  GFP  mRNA)  and  a  trigger  RNA  that  modu¬ 
lates  the  output  signal.  Conventional  riboregulators  have  histori¬ 
cally  repressed  translation  by  sequestering  the  ribosomal  binding 
site  (RBS)  of  the  transducer  RNA  within  a  hairpin.  This  hairpin  is 
unwound  upon  binding  of  a  cognate  trigger  RNA,  exposing  the 
RBS  and  enabling  translation  of  the  downstream  protein.  How¬ 
ever,  this  design  restricts  the  potential  trigger  RNAs  to  those 
that  contain  RBS  sequences.  Toehold  switches  have  removed 
this  constraint  by  moving  the  RBS  to  a  loop  region  of  the  hairpin, 
leaving  the  trigger  RNA  binding  site  free  to  adopt  virtually  any 
sequence.  The  transducer  or “switch”  RNA  of  toehold  switches 
also  contains  a  single-stranded  domain  known  as  a  toehold  at 


Figure  1*  Paper-Based  Synthetic  Gene  Networks 

(A)  The  enzymes  of  transcription  and  translation  are  combined  with  engineered  gene  circuits  and  then  embedded  and  freeze  dried  onto  paper  to  create  stable  and 
portable  synthetic  gene  networks  outside  of  the  cell.  These  networks  are  genetically  encoded  toots  with  trigger,  regulatory  transducer,  and  output  elements. 

(B)  GFP  expression  in  solution  phase  from  fresh  and  freeze  -dried  PT7  cell-free  reactions. 

(G)  Freeze-dried  pellets  of  the  PT7  cell-free  expression  system  are  stable  for  over  a  year  at  room  temperature,  yielding  GFP  expression  in  solution  phase  when 
rehydrated. 

(D)  A  schematic  of  the  constitutive  GFP  expression  constructs  used  on  paper. 

(E)  Images  of  paper  discs  following  a  2  hr  incubation  and  maximum  fold-change  measurements  of  constitutive  paper-based  GFP  expression  from  freeze-dried 
S30,  $30  T7,  and  RT7  cell-free  systems  during  the  first  90  min  of  incubation. 

(F)  Schematic  of  tetO  regulation  for  GFP  and  mCherry. 

(G)  Images  of  paper  discs  following  a  2  hr  incubation  and  maximum  fold -change  measurements  of  GFP  and  mCherry  from  the  tetO -regulated  promoter  during  the 
first  2  hr  of  incubation  ±  aTc  inducer  [1 1  nM)  from  freeze-dried  S30  reactions.  BF  indicates  bright- field  images,  and  F  indicates  fluorescence  images. 

Error  bars  represent  SO, 
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Figure  2.  Freeze-Oried  RNA-Actuated  Gene  Circuits  on  Paper 

[A)  Schematic  of  the  RNA-based  toehold  switch  regulating  translation  of  GFP, 

(B)  Images  of  paper-based  GFP  expression  from  eight  toehold  switches  (A-H)  in  the  PT7  cell-free  system,  ±  complementary  trigger  RNAs  following  a  2  hr 
incubation. 

P  Maximum  fold-change  measurement  of  GFP  expression  from  paper-based  toehold  switches  A-H  during  the  first  90  min  of  incubation. 

P)  RNA-actuated  expression  of  GFP,  Venus.  mCherry,  and  Cerulean  fluorescent  proteins  from  toehoid  switch  H  on  paper  and  quartz  microfiber  discs  following  a 
2  hr  incubation. 

(E)  Bright  field  and  fluorescence  images  of  an  orthogonality  screen  between  toehold  switches  and  trigger  RNAs  using  paper- based  reactions  arrayed  in  a  384- well 
plate,  Images  collected  after  the  overnight  data  collection, 

(F)  Quantification  of  fluorescence  overtime  from  paper  discs  containing  switch  G  (bottom  row  of  the  array).  All  data  were  generated  from  freeze-dned,  cell-free 
reactions  embedded  into  paper  with  their  respective  gene  circuits.  Trigger  RNA  concentration  used  for  toehold  switch  activation  was  5  jiM.  BF  indicates  bright- 
field  images,  and  F  indicates  fluorescence  images. 

Error  bars  represent  SD. 
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its  5'  end.  This  toehold  domain,  first  developed  in  in  vitro  molec¬ 
ular  programming  studies  (Yurke  et  al.,  2000),  provides  the  initial 
reaction  site  for  binding  between  the  trigger  and  switch  RNAs  and 
greatly  improves  the  ON/OFF  ratio  of  the  switches. 

We  began  by  demonstrating  the  regulation  of  GFP  expression 
under  the  control  of  eight  different  toehold  switches  on  paper. 
Experiments  were  performed  by  freeze  drying  the  recombinant 
PT7  expression  system  onto  paper  discs,  along  with  linear 
DNA  encoding  specific  switch  RNAs.  The  paper  discs  were  rehy¬ 
drated  with  or  without  the  complementary  RNA  trigger  24  hr  after 
drying  and  then  incubated  at  37°C  in  humid  chamber  for  2  hr.  As 
expected,  discs  that  received  the  trigger  RNA  exhibited  strong 
GFP  expression,  whereas  control  discs  exhibited  little,  if  any, 
expression  (Figure  28),  Toehold  switch  and  trigger  RNAs  can 
also  be  directly  applied  to  the  paper  at  the  appropriate  times 
or  synthesized  from  DNA  on  the  paper.  By  freeze  drying  the 
paper  with  pretranscribed  toehold  switch  RNA,  we  were  able 
to  detect  positive  reactions  6  to  8  min  earlier  than  having  the 
RNAs  transcribed  in  place  (data  not  shown).  However,  due  to 
the  greater  stability  of  DNA,  we  chose  to  freeze  dry  all  paper  re¬ 
actions  with  DNA  copies  of  the  respective  toehold  switches. 

Parallel  experiments  were  done  to  measure  reaction  kinetics 
We  found  that,  within  90  min  of  37"C  incubation,  the  maximum 
ON/OFF  ratios  ranged  between  10-  and  140-fold  {Figure  2G), 
with  signal  detection  in  as  little  as  20  min  (Figure  S2A).  Maximum 
expression  rates  occurred  between  60  and  1 20  min,  and  the  re¬ 
action  output  generally  reached  a  plateau  by  200  min.  Taking 
advantage  of  the  nuclease-free  nature  of  the  PT7  recombinant 
system,  we  titrated  RNA  directly  into  the  reactions  and  found  a 
linear  response  to  RNA  concentration  between  low  nanomolar 
and  low  micromolar  concentrations  {Figure  S2B).  We  continued 
to  use  purified  trigger  RNAs  so  that  we  could  control  the  concen¬ 
tration  and  chose  5  pM  trigger  RNA  to  maximize  response  in 
demonstration  reactions.  It  is  also  of  note  that  switch  behavior 
in  freeze-dried  lysates  replicates  that  of  toehold  switches  in  vivo 
in  E.  coti  {Figure  S2C;  Green  et  al,.  2014),  Importantly,  the  clear 
target-dependent  induction  of  the  toehold  switches  in  the 
RNA-rich  environment  of  £.  coli  helps  to  demonstrate  sequence 
specificity  of  these  RNA  components. 

Whole-cell  extracts  and  cellulose  fibers  exhibit  an  inherent  au¬ 
tofluorescence  around  the  emission  spectrum  of  GFP  (Schmidt, 
201 0),  so  we  also  constructed  gene  circuits  to  contain  output 
fluorescent  proteins  with  alternate  emission  spectra  for  both  pa¬ 
per  and  quartz  microfiber  discs  (Figures  2D  and  S2D).  With  these 
distinct  outputs,  a  single  paper  disc  can  aiso  be  used  to  host 
multiple  toehold  switches,  each  capable  of  individual  activation 
and  measurement  {Figure  S2E).  We  also  successfully  tested 
toehold  switches  In  S30  T7  £  cot!  cell  extracts,  a  lower  cost  alter¬ 
native,  using  circularized  DNA  plasmid  as  a  template  for  both 
toehold  switch  and  trigger  RNAs  {data  not  shown). 

By  arraying  the  paper  discs  into  a  38 4 -well  plate,  multiplexed 
reactions  can  be  easily  quantified  and  measured  over  time.  We 
tested  the  orthogonality  of  the  toehold  switches  by  combining 
each  of  the  eight  toehold  hairpins  against  each  of  the  trigger 
RNAs.  As  seen  in  Figure  2E,  activation  of  the  toehold  switches 
was  essentially  limited  to  the  diagonal,  with  only  one  weak  off- 
target  interaction  between  switch  E  and  trigger  H  (Figures  2E, 
2Fr  S2G,  and  S2H)r  Despite  not  being  explicitly  designed  to  be 


orthogonal  to  one  another,  significant  orthogonality  was  also  re¬ 
flected  in  the  specific  activation  demonstrated  above  in  the 
in  vivo  milieu  of  bacteria  cells  (Figure  S2C),  This  arrayed  ap¬ 
proach  aiso  underscores  another  important  advantage  of  our 
in  vitro  paper-based  system,  namely,  the  rapidity  with  which  it 
can  used  to  rationally  design  and  test  synthetic  constructs.  To 
collect  similar  data  in  vivo  would  require  81  separate  transforma¬ 
tions,  overnight  cultures  on  plates,  and  subsequent  culturing  in 
multiwell  plates  and  could  take  several  days.  Using  in  vitro  pa¬ 
per-based  reactions,  this  experiment  took  under  90  min  to  set 
up,  and,  upon  incubation,  signals  from  activated  switches  were 
detected  in  as  little  as  20  min  {Figure  S2H). 

Enabling  Technologies 

In  keeping  with  our  goal  of  making  synthetic  biology-based  tools 
widely  available  outside  of  the  laboratory,  we  designed  our  pa¬ 
per-based  system  to  generate  a  colorimetric  output  visible  to 
the  naked  eye.  To  do  so,  we  replaced  GFP  with  the  enzyme 
p-galactosidase  (LacZ)  to  create  a  synthetic  gene  network  that 
generates  a  dramatic  enzyme- mediated  color  change  in  res¬ 
ponse  to  conditional  inputs  (Figure  3A).  LacZ  cleaves  the  yellow 
s  u  bstrate ,  ch  to  ro  p  h  eno  \  red- [3-D- galactopy  ran  aside ,  embed  d  ed 
into  the  freeze-dried  paper  discs,  to  produce  a  purple  chlorophe- 
nol  red  product  that  is  visible  to  the  naked  eye  and  can  be 
measured  on  standard  plate  readers  by  monitoring  the  absor¬ 
bance  at  570  nm 

As  with  fluorescent  outputs  from  the  GFP  toehold  switches, 
each  triggered  LacZ  switch  produces  a  unique  switch-depen¬ 
dent  response,  with  maximum  expression  rates  occurring  be¬ 
tween  50  and  1 20  min  (Figures  30,  3C,  and  S3  A),  Some  toehold 
switches  rapidly  reached  their  maximum  output,  whereas  others 
produced  a  relatively  linear  output  throughout  overnight  experi¬ 
ments.  To  maximize  detection  speed  and  sensitivity,  we  used 
the  enzymatic  reaction  rate  determined  from  the  slope  of  absor¬ 
bance  at  570  nm  over  time  to  calculate  fold  change  for  the  family 
of  LacZ  toehold  switches  (Figure  3C).  As  with  the  fluorescent  re¬ 
actions,  paper-based  reactions  with  colorimetric  outputs  can  be 
arrayed  for  large-scale,  quantitative  experiments  (Figure  S3B). 

Using  switch  D  to  demonstrate  color  development,  detection 
of  the  reaction  can  be  seen  by  the  naked  eye  to  begin  at 
25  min  and  saturates  in  -1  hr  (Figure  3D).  Quantification  of  these 
reactions  can  also  easily  be  done  by  measuring  the  intensity  of 
the  color  channels  red,  green,  and  blue  from  images  produced 
by  most  cameras,  including  those  widely  available  in  cellphones. 
By  tracking  the  generation  of  signal  in  the  blue  channel,  relative 
to  yellow  signal  (red  +  green  channels),  the  progress  of  the  acti¬ 
vated  freeze-dried  toehold  switch  reaction  can  be  assessed  and 
quantified  {Figure  3E),  Similar  results  were  observed  for  the 
whole  family  of  toehold  switches  {Figure  3B). 

We  have  also  incorporated  an  alternative  colorimetric  reporter 
enzyme,  chitinase,  into  synthetic  gene  networks  for  systems 
based  on  extracts  from  cells  that  contain  a  LacZ  background 
{Figure  S3G).  As  demonstrated  with  our  tetO-GFP  and  tetO- 
mCheiry  switches,  the  presence  of  aTc  inducer  results  in  the 
expression  of  chitinase,  which  cleaves  a  colorless  substrate 
(4-Nitrophenyl  N,N'-diacetyl-3-D-chltobioside)  to  yield  a  yellow 
p-nitropheno!  product.  The  colorimetric  output  is  visible  to  the 
naked  eye  and  can  be  quantified  using  a  standard  plate  reader 
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(410  nrn;  Figures  S3D-S3F).  Color  development  with  this  system 
was  linear  from  the  onset  of  detectable  output  at  about  60  min, 
until  the  end  of  the  experiment  at  800  min,  with  a  maximum  in¬ 
duction  of  38-fold  at  420  min. 

An  important  advantage  of  paper-based  distribution  of  syn¬ 
thetic  gene  networks  is  their  potential  for  low  cost  and  relative 
ease  to  manufacture.  As  a  proof-of -principle  demonstration, 
we  created  printed  arrays  of  stable  synthetic  gene  networks 
using  a  standard  computer  printer  and  chromatography  paper 
(Figure  3F).  A  commercially  available  wax -based  ink  serves  as 
a  hydrophobic  barrier  separating  respective  reaction  spaces  to 
create  custom  arrays  and  layouts  (Camlho  et  al.r  2009).  As  a 
demonstration  of  the  technique,  we  printed  5x5  arrays  to 
host  the  freeze-dried  LacZ-expressing  toehold  switch  E  and 
then  rehydrated  the  array  with  and  without  trigger  RNA  to  create 
a  checkerboard  pattern  (Figure  3G). 

For  other  practical  applications,  we  built  a  low-cost  electronic 
optical  reader  to  enable  quantification  and  ultimately  automation 
of  the  paper-based  reactions.  Such  a  device  could  also  impart 
these  molecular  devices  with  the  ease  of  use  and  convenience 
of  a  home  glucose  monitor  To  measure  light  transmission 
through  these  reactions,  we  placed  LacZ-based  toehold 
switches  on  paper  discs  between  LED  light  sources  (570  nm) 
and  electronic  sensors  (Figure  3 H).  In  the  event  of  a  positive  re~ 
action,  light  transmission  is  progressively  blocked  by  the  produc¬ 
tion  of  the  purple  LacZ  cleavage  product.  LEDs  and  sensors  were 
coordinated  through  multiplexers  connected  to  an  Arduino  that 
controlled  the  read  pattern  and  rate  parameters  (Figure  S3G). 
Electronic  hardware  was  housed  using  computer-designed  parts 
laser  cut  from  acrylic  to  create  a  device  for  under  $1 00  USD  (Fig¬ 
ures  3H  and  S3H).  Freeze-dried  paper  discs  were  placed  into 
holes  on  a  chip,  rehydrated,  incubated  in  the  device  at  37“  C, 
and  monitored  in  real  time  through  an  attached  laptop.  As  a  proof 
of  concept,  we  tested  the  LacZ  toehold  switch  G  and  observed 
consistent  and  significant  reads  from  different  concentrations 
of  RNA  trigger  (Figure  31). 

An  In  Vitro  Diagnostics  Platform 

With  a  device  in  hand,  we  next  turned  our  focus  to  developing 
toehold  switch  sensors  capable  of  detecting  full-length  active 
mRNA  targets,  a  key  feature  for  a  diagnostics  platform.  Our  first 
goal  was  the  detection  of  GFP  and  mCherry  mRNAs.  Using  an 


algorithm  that  predicts  RNA  secondary  structure,  mRNA  sensors 
were  built  to  target  sequences  likeiy  to  be  accessible  for  binding 
(Green  et  al  ,  2014)  and,  when  tested,  yielded  fluorescent  induc¬ 
tion  in  the  presence  of  GFP  (60-fold)  and  mCherry  (13-fold) 
mRNAs.  respectively  (Figures  4B  and  4C),  As  a  demonstration 
of  the  potential  for  pa  per- based  synthetic  gene  networks  as  an 
in  vitro  diagnostics  platform,  we  next  developed  colorimetric 
mRNA  sensors  for  antibiotic  resistance  genes  (Figure  4 A).  In 
freeze-dried,  paper- based  reactions,  mRNA  sensors  for  specti- 
nomycin  (5-fofd).  chloramphenicol  (7.5-fold),  kanamycin  (24- 
fold),  and  ampicillin  (30-fold)  resistance  genes  yielded  significant 
LacZ  induction  in  the  presence  of  their  respective  mRNAs  at 
3,000  nM  (Figures  40  and  S4A),  mRNAs  used  in  these  proof- 
of-concept  experiments  were  provided  in  their  purified  form. 
However,  given  the  fidelity  of  the  toehold  switches  in  the  RNA- 
rich  environment  of  the  cell  (Figure  S2G;  Green  et  al.,  2014), 
we  are  confident  that  in  vitro  paper-based  toehold  sensors  will 
exhibit  similar  specificity  in  the  presence  of  mixed  RNA  samples. 

Switching  from  the  plate  reader,  we  then  tested  the  ampicitlin 
resistance  mRNA  sensor  using  the  electronic  optical  reader  and 
found  significant  detection  of  target  transcripts  as  tow  as  3  nM 
(Figures  4E  and  S4B).  Moreover,  the  signal  from  our  in-house  de¬ 
vice  was  about  three  times  greater  than  the  plate  reader  at  equiv¬ 
alent  trigger  RNA  concentrations  (Figures  4D,  4E,  and  S4A), 

In  addition  to  the  purpose-built  synthetic  gene  networks  pre¬ 
sented  above,  this  in  vitro  paper-based  approach  also  holds 
tremendous  potential  for  hosting  other  gene  circuits,  including 
those  already  in  the  literature  as  synthetic  biology  tools.  More¬ 
over,  although  the  work  above  focused  on  bacterial  compo¬ 
nents,  the  platform  should  in  principle  be  readily  extendible  to 
mammalian-based  systems.  As  confirmation,  we  tested  a  pa¬ 
per-based  system  using  freeze- dried  He  La  cell  extracts  and 
found  strong  expression  of  GFP  in  the  presence  of  a  constitutive 
expression  plasmid  (Figure  4F).  To  demonstrate  the  use  of  a  pre¬ 
existing  molecular  tool,  we  chose  to  build  a  portable,  in  vitro 
glucose  sensor  on  paper  using  a  FRET-based  nanosensor 
originally  designed  for  measuring  glucose  in  mammalian  cells 
(Figure  4G;  Takanaga  and  Frommer,  2010).  The  DNA  construct 
for  the  nanosensor  was  freeze  dried,  along  with  HeLa  cell  ex¬ 
tracts  onto  paper  discs,  and  then  re  hydrated  in  the  presence 
or  absence  of  glucose.  Upon  rehydration,  we  observed  the 
glucose-related  shift  previously  reported  in  mammalian  cells  in 


Figure  3+  Colorimetric  Output  from  Paper-Based  Synthetic  Gene  Networks 

(A)  A  schematic  of  the  modified.  LacZ-expressing  toehold  switches  used  to  generate  colorimetric  outputs. 

(B)  Images  of  the  paper-based,  colorimetric  output  from  toehold  switches  ±  complementary  RNA  triggers  following  a  2  hr  incubation. 

(C)  Maximum  fold-change  measurements  from  LacZ  toehold  switches  during  the  first  2  hr  of  incubation.  Fold  induction  based  on  the  rate  of  color  change  from 
LacZ  toehold  switches, 

(D)  The  paper-based  development  of  color  from  LacZ  toehold  switch  O  over  60  min, 

(E)  Color  intensities  from  (D)  were  converted  to  a  ratio  of  blue  over  yellow  (red  +  green  channels)  channels  and  graphed  over  time. 

(F)  Schemata  describing  the  process  of  arraying  synthetic  gene  networks  on  paper  using  printed  arrays. 

(G)  A  25-reaction  printed  array  (14  x  14  mm)  of  toehold  switch  E,  containing  positive  reactions  (purple)  and  negative  control  reactions  (yellow)  distributed  in  a 
checkerboard  pattern  following  a  2  hr  incubation. 

(H)  Sc  hematic  of  the  low-cost,  electronic  optical  reader  developed  to  read  colorimetric  output  from  paper-based  synthetic  gene  networks,  Paper  reactions  held  in 
a  chip  between  an  LED  light  source  [570  nm)  and  electronic  sensor. 

(t)  Time-course  data  from  the  electronic  optical  reader  of  toehold  switch  G  in  the  presence  of  0, 30,  300  and  3,000  nM  trigger  RNA.  Data  were  collected  every  10s 
with  SO  indicated  by  line  thickness,  and  all  data  were  generated  from  freeze-dried,  cell-free  reactions  embedded  into  paper  with  their  respective  gene  circuits. 
Trigger  RNA  concentrations  used  for  toehold  switch  activation  were  5  ^iM  or  as  specified. 

Error  bars  represent  SD, 


Gel!  159,  1-15,  November  6,  2014  ©2014  Elsevier  Inc.  7 


Cel 

1 

Please  cite  this  article  in  press  as:  Pardee  et  a!.,  Paper-Based  Synthetic  Gene  Networks,  Cell  (2014),  http://dx,doi,org/1Q,1 
j.celL201 4. 10.004 

016/ 

e  *  • 

Activated  complex 


Fluorescent  reporters 

I 


Lac Z  enzyme 

\ 


Research  tod 
output 


Colonmetnc 
diagnostics  output 


B 


mChemy  mRNA  sensor 


mCheny 

mRNA 


20 


mCherry  mRNA 


GFP  mRNA 


F 


G 


H 


Glucose  sensor 


Plasmid  DNA  “  + 

HeLa  cell  extract 


7500 


6S00 


5500 


Glucose  2 

eel 


Figure  4„  Application  of  Paper-Based  Synthetic  Gene  Networks 
(A}  Schematic  of  the  paper-based  mRNA  sensors  based  on  toehold  switches. 

(B)  Images  and  fold -change  measurements  of  a  paper-based  mCherry  mRNA  sensor  in  the  presence  and  absence  of  full-length  target  mRNA,  following  a  2  hr 
incubation.  GFP  is  produced  tn  response  to  detection  of  mCherry  mRNA, 

(C)  Images  and  fold -change  measurements  of  a  paper-based  GFP  mRNA  sensor  in  the  presence  and  absence  of  full-length  target  mRNA.  mCherry  is  produced  in 
response  to  detection  of  GFP  mRNA, 

(D)  Maximum  fold  change  during  the  first  90  min  of  the  LacZ-rnediated  color  output  rate  from  sensors  for  mRNAs  encoding  resistance  to  spectinomycin, 
chloramphenicol,  ampidlltn,  and  kanamycin  antibiotics  using  a  plate  reader. 


( legend  continued  on  next  page) 
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our  portable  paper-based  system  within  a  physiologically  rele¬ 
vant  glucose  concentration  range  (Figures  4H  and  S4C),  High¬ 
lighting  the  stability  of  this  FRET-based  sensor,  the  SO  of  reads 
remains  low  even  when  compiled  from  data  collected  over  a 
100  min  period.  Importantly,  de  novo  translation  of  the  nanosen¬ 
sor  seems  to  be  critical  to  function  because  freeze-dried  prepa¬ 
rations  of  pretranslated  protein  did  not  exhibit  the  characteristic 
shift  in  fluorescence. 

Rapid  Sensor  Prototyping 

The  low  cost  of  manufacturing  (4-65 c/sensor)  is  a  key  feature  of 
the  paper-based  platform  for  diagnostics.  However,  perhaps 
even  more  important  for  the  adoption  and  potential  impact  of 
this  technology  is  the  time  and  cost  of  developing  new  sensors. 
To  test  how  rapidly  new  sensors  could  be  developed,  we  chose 
to  build  mRNA  sensors  for  the  viral  pathogen  Ebola.  Our  goal 
was  to  construct  and  test  24  sensors  that  could  distinguish 
between  the  Sudan  and  Zaire  strains  of  the  virus  in  under  a 
day.  Using  our  algorithm,  sensors  were  designed  to  target 
mRNA  from  12  regions  (A-L)  of  the  mRNA  encoding  the  Ebola 
nuc I eo protein,  which  differs  in  length  by  only  three  nucleotides 
between  the  Sudan  and  Zaire  strains  (Figure  5A),  Construction 
began  with  PCR  amplification  of  synthetic  135-nt  DNA  oligos 
bearing  the  toehold  switch  sensor  cassettes,  followed  by  ligation 
of  the  modules  to  our  LacZ  reporter  chassis.  The  ligated  product 
was  then  amplified  by  PCR,  Followed  by  only  column  purifica¬ 
tion,  sensors  were  tested  on  paper  discs  containing  the  freeze- 
dried  PT7  cell-free  system  for  testing  in  the  presence  and 
absence  of  36  nt  trigger  RNAs.  Here,  we  chose  to  add  the  sensor 
element  during  the  re  hydra!  ion  phase  of  the  experiment  to 
demonstrate  the  potential  for  rapid  prototyping  and  to  highlight 
that  the  user  does  not  need  to  know  what  sensors  will  be  tested 
ahead  of  time,  as  might  be  the  case  with  an  emerging  pathogen. 
Due  to  potential  challenges  with  access  to  full -length  material, 
we  chose  to  use  short  36  bp  trigger  RNAs  rather  than  full-length 
mRNA  to  test  the  sensors.  Although  accessibility  due  to  RNA 
secondary  structure  is  important  to  keep  in  mind,  work  from 
our  previous  mRNA  sensors  suggests  that  short  RNA  triggers 
provide  a  representative  indication  of  sensor  function. 

The  colorimetric  dynamics  of  the  240  reactions  were  captured 
using  a  pEate  reader  and  could  be  tracked  by  eye  (Figures  5B  and 
5C).  Each  of  the  24  sensors  was  triggered  in  the  presence  of  their 
target  (3,000  nM)3  with  maximum  induction  during  the  first  90  min 
ranging  4-  to  77-fotd,  Remarkably,  beginning  with  the  arrival  of 
the  DNA  oEigos.  this  phase  of  the  screen  was  completed  in  less 
than  12  hr  We  then  selected  four  matching  sets  (D,  E,  G.  and 
H)  of  the  Sudan  and  Zaire  sensors  to  test  specificity  and  found 
a  high  degree  of  strain-specific  discrimination  (Figure  5D),  as 


well  as  sensitivity  down  to  a  concentration  of  30  nM  trigger 
RNA  for  both  strains  (Figure  5E), 

Rapid  Prototyping  and  Testing  of  Complex  Synthetic 
Gene  Networks 

As  a  demonstration  of  the  capacity  of  the  paper- based  system 
for  the  prototyping  and  testing  of  more  complex  synthetic  gene 
circuits,  we  assembled  a  converging  transcription  cascade 
that  can  convert  the  transcription  activity  of  the  native  E.  coli 
RNAP  in  a  S30  cell  extract  into  T3  and/or  T7  RNAP  activity  (Fig¬ 
ure  6 A).  To  this  point  we  have  validated  the  paper-based  system 
using  more  than  45  toehold  switches,  each  with  unique  combi¬ 
nations  of  inputs  and  measure  able  outputs.  Here,  we  assembled 
toehold  switches  to  create  a  series  of  molecular  reactions  that 
lead  to  the  de  novo  production  of  transcriptional  tools.  In  the 
presence  of  DNA  encoding  the  toehold  switch  and  trigger  for 
the  T3  module,  T3  RNAP  is  transcribed  and  translated,  which  ac¬ 
tivates  T3 -mediated  transcription  and  translation  of  GFP  from  an 
otherwise  passive  component  (Figure  6B).  Alternatively,  in  the 
presence  of  DNA  encoding  the  toehold  switch  and  trigger  for 
the  T7  module,  T7  RNAP  is  transcribed  and  translated,  leading 
to  the  expression  of  GFP  from  a  T7 -dependent  promoter 
(Figure  6C),  If  both  the  T3  and  T7  toehold  switch  modules  are 
combined,  E.  coli  RNAP  generates  tooth  T3  and  T7  RNAPs. 
This  converging  transcription  activity  leads  to  their  respective 
trigger  and  toehold  switch  RNAs,  producing  a  third  route  to 
GFP  expression  (Figure  6D),  This  work  shows  that  the  platform 
can  sustain  multiple  rounds  of  transcription  and  translation  and 
be  utilized  to  construct,  test,  and  debug  complex  circuits  in  a 
modular  fashion, 

DISCUSSION 

We  have  developed  a  method  for  embedding  synthetic  gene  net¬ 
works  into  paper  and  other  porous  materials,  creating  a  much- 
needed  path  for  moving  synthetic  biology  out  of  the  tab  and 
into  the  field.  Here,  we  have  constructed  a  number  of  purpose- 
built  synthetic  gene  networks  responsive  to  synthetic  RNA, 
mRNA,  and  small  molecules.  This  includes  paper-based  toehold 
switches,  which  generate  networks  with  greater  than  20-fold 
(Figures  2C  and  3C)  and  as  high  as  350 -fold  induction  (Fig¬ 
ure  S2F).  As  demonstrated  with  the  glucose  sensor,  this  paper- 
based  approach  also  promises  to  convert  existing  constructs 
designed  for  basic  research  and  biotechnology  into  portable 
and  readily  accessible  molecular  tools  (Figures  4G  and  4H) 

The  in  vitro  nature  of  These  reactions  affords  many  benefits. 
For  instance,  without  having  to  contend  with  maintaining  the 
plasmid  over  generations  of  cell  division,  complex  genetically 


(E)  Maximum  fold  change  during  the  first  90  min  of  the  color  output  rate  from  the  ampicillin  resistance  sensor  using  the  in-house  electronic  optical  reader  over  a 
titration  of  mRNA  concentrations. 

(F)  Images  of  paper  discs  following  a  2  hr  incubation  and  fold-change  measurement  of  constitutive  paper-based  GFP  expression  from  a  freeze-dried  HeLa  ceil 
extract. 

(G)  Schematic  of  the  FRET-based  mechanism  used  in  the  glucose  nano  sensor. 

(H)  Using  an  average  of  the  data  collected  every  10  min  between  390  min  and  490  min,  the  528  nm  fluorescence  is  reduced  in  response  to  glucose  binding  to  the 
FRET-based  glucose  nanosensor  expressed  on  paper.  All  data  were  generated  from  freeze-dried,  cell-free  reactions  embedded  into  paper  with  their  respective 
gene  circuits. 

Error  bars  represent  SD. 


Ceil  159 ,  1-15,  November  6,  2014  ©2014  Elsevier  Inc.  9 


Please  cite  this  article  in  press  as: 
jxeli.201 4.1 0,004 


Pardee  et  al.,  Paper-Based  Synthetic  Gene  Networks,  Cell  (2014),  http;//dx<doLorg/1 0,1016/ 


of  the  36  bp  iarget  regions 


Ebola  RNA  sensors 


I 

o 

s 


ioon 


80- 


60 


40- 


20- 


Sodan  strain  sensors 


li 


ill 


j1 


il 


1 


Trigger  RNA  -  +  -  +  + 

SA  SB  SC  SD  SE  SF  SG  SH  St  SJ  SK  SL 


Zaire  strain  sensors 


ill. 


A 


1 


JlL 


i 


—  +  —  +  +  —  +  —  +  + 
ZA  ZB  ZC  ZD  ZE  ZF  ZG  ZH  Zl  ZJ  ZK  ZL 


Sudan 

strain  sensors 


Control 

reactions 


Zaire 

strain  sensors 


Control  react  ions  or  urtlriggered  RNA  sensors 
•  Triggered  RNA  sensors 


Figure  5,  Rapid  Prototyping  of  Paper-Based  RNA  Sensors  for  Sequences  from  Sudan  and  Zaire  Strains  of  the  Ebola  Virus 

(A)  Schematic  of  the  generation  of  Ebola  RNA  sensors.  Sensors  with  the  same  letter  were  targeted  to  identical  windows  in  the  Ebola  nucleoprotem  mRNAs  of  their 

respective  strains, 

(8)  Twenty-four  toehold  switch-based  RNA  sensors  were  constructed  and  tested  in  a  12  hr  period.  Based  on  the  RNA  segment  windows  (A-L),  maximum  fold 
change  during  the  first  9D  mm  at  SVC  is  reported  for  both  the  Sudan  and  Zaire  strains  of  the  virus.  Fold  change  rate  is  determined  from  the  slope  of  absorbance  at 
570  nm  over  time  (-  control,  +  3000  nM  RNA  trigger], 

(CJ  Composite  image  of  the  240  paper-based  reactions  used  to  test  the  24  sensors  after  data  collection  overnight.  Control  and  untriggered  toehoEd  sensors 
remain  yellow,  and  activated  toehold  sensors  have  turned  purple. 

(legend  continued  on  next  page) 
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Figure  6.  Pa  per- Based  Converging  Transcriptional  Cascade 

(A)  Schematic  of  the  geneticaFEy  encoded  components  that  convert  transcription  from  &  coti  RNAP  into  transcription  from  T3  RNAP  and/or  T7  RNAP.  Expression 
of  these  new  RNAPs  dnves  the  transcription  of  previously  dormant  GFP  constructs  containing  T3  or  T7  promoters,  and  alternatively,  they  can  drive  expression  of 
a  toehold  switch  and  trigger  pair  under  the  regulation  of  T7  and  T3,  respectively, 

(BJ  Images  and  fold-change  measurements  of  paper-based  T3  GFP  expression  with  and  without  the  T3  cascade  module, 

(C)  Images  and  fold-change  measurements  of  paper-based  T7  GFP  expression  with  and  without  the  17  cascade  module. 

(D>  images  and  fold-change  measurements  of  paper-based  T3/T7-dependent  GFP  expression  with  and  without  the  T3  and  T7  cascade  modules.  All  data  were 
generated  from  freeze-dried,  cell-free  reactions  embedded  into  paper  with  their  respective  GFP  expression  constructs;  cascade  modules  were  added  as  DMA 
components  at  rehydration.  Data  were  collected  after  an  overnight  incubation. 

Error  bars  represent  SD. 


encoded  networks  can  be  assembled  without  the  need  for  coor¬ 
dinating  selective  antibiotic  pressures.  Moreover,  paper-based 
reactions  can  serve  as  a  mock  cell,  allowing  for  linear  PCR  prod- 
acts  to  be  screened  directly  for  function  without  sequencing, 
assembly  into  circularized  plasmids,  or  even  gel  purification  of 
fragments.  We  took  advantage  of  these  features  in  our  paper 
disc  arrays  to  measure  toehold  orthogonality,  as  well  as  Ebola 
sensor  prototyping  and  complex  circuit  testing  (Figures  2E  and 


2F,  5, 6,  S2G,  S2H,  and  S3B),  which  allowed  us  to  rapidly  screen 
for  gene  circuit  performance. 

The  paper- based  cell  extracts  also  offer  a  unique  opportunity 
for  molecular  biologists  to  work  with  difficult -to -culture  organ¬ 
isms,  By  generating  cell  extracts  from  engineered  cell  lines, 
pathogens,  or  other  specialized  organisms  {e.g.,  extremophiies 
and  symbionts),  these  reactions  could  serve  as  a  proxy  for 
biology  that  is  otherwise  inaccessible  to  the  broad  research 


(D)  Sequence  specificity  tested  for  four  Sudan  and  four  Zaire  sensors  from  the  original  set  of  24.  Each  of  the  four  sensors  targeting  Sudan  sequences  were  treated 
with  3,000  nM  of  off-target  RNA  sequence  from  the  complementing  Zaire  RNA  sequence  and  vice  versa, 

(E)  Fold  change  of  the  color  output  rate  of  sensors  SD  and  ZH  over  a  titration  of  RNA  concentrations.  All  data  were  generated  from  freeze-dried,  cell-free  reactions 
embedded  into  paper.  DNA  containing  the  sensor  and  RNA  loggers  was  added  during  rehydration. 

Error  bars  represent  SD. 
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community.  Similarly,  the  incorporation  of  human  cell  extracts 
into  our  paper-based  scheme  is  an  exciting  feature  that  leads 
to  the  prospect  of  diagnostic  and  synthetic  biology  applications 
based  on  the  thousands  of  human  and  mammalian  transcription 
factors  (Hughes,  201 1),  such  as  nuclear  receptors,  with  complex 
and  nuanced  regulatory  features,  including  small -molecule- 
responsive  regulation  (Pardee  et  al.,  2011). 

Perhaps  most  exciting  is  the  potential  to  bring  the  rational 
design  approach  of  synthetic  biology  to  in  vitro  diagnostics 
and  sensing.  Although  competing  anti  body -based  technologies 
such  as  rapid  diagnostic  tests  (RDTs)  currently  have  greater 
sensitivity,  we  expect  advances  in  molecular  techniques  and 
analyte  concentration  methods  to  solve  these  challenges.  Exam¬ 
ples  of  existing  technologies  that  could  be  implemented  as  part 
of  the  platform  include  isothermal  nucleic  acid  sequence-based 
amplification  of  target  RNA  inputs  (Van  et  a!„  2014)  or  concentra¬ 
tion  of  pathogen  through  generic  ops  on  in- mediated  capture 
(Cooper  et  af.,  2014;  Kang  et  al.,  2014).  Moreover,  rather  than 
competing  with  technically  demanding  and  expensive  lab-based 
techniques  like  ELISA,  PCR,  and  mass  spectrometry,  we  envi¬ 
sion  paper-based  systems  creating  a  new,  low-cost  generation 
of  sensors  that  could  be  embedded  ubiquitously  into  daily  life 
(including  clothing).  Importantly,  we  were  able  to  run  toehold 
switch  reactions  in  a  variety  of  porous  materials,  including  cloth, 
lab  membranes,  and  porous  alumina  (data  not  shown). 

Unlike  antibody-based  diagnostics,  toehold  switch  mRNA  sen¬ 
sors  offer  a  sequence-based  method  of  detection  that  means 
research  and  clinical  tools  can  be  designed  rationally,  lowering 
development  costs  and  allowing  for  significantly  shorter  design- 
to -product ion  cycles.  Here,  we  presented  toehold-based  mRNA 
sensors,  including  24  Ebola  sensors  that  were  constructed  in 
less  than  12  hr.  The  DNA  input  cost  for  the  Ebola  series  was 
$21  USD/sensor,  These  features  compare  highly  favorably  to 
custom  commercial  antibody  production,  where  development 
time  is  typically  2  to  6  months  and  development  costs  range 
from  $4,000  to  S30.0OO  USD.  Further,  sensors  can  be  designed 
from  sequence  information  alone,  which  is  ideal  for  emerging 
pathogens  (Figure  5),  and  the  acquisition  of  other  relevant  clinical 
information,  such  as  the  presence  of  drug -resistance  genes  (Fig¬ 
ures  4D  and  4E)  or  other  indicators  of  pathogenesis,  such  as  bio- 
film-specific  RNAs  (Ddtsch  etaL,  2012).  Coupled  to  an  electronic 
reader,  these  paper- based  systems  also  offer  the  potential  for 
quantitative  diagnostics  (Figures  3f  and  4E),  a  much- needed 
feature  that  is  largely  unavailable  with  RDTs  (Cord ray  and 
Richards -Kortum,  2012).  Moreover,  the  added  sensitivity  (2. 5-3, 5 
x  )  of  our  purpose-built,  electronic  reader  could  be  considered  a 
hardware  enhancement  of  the  designed  gene  circuits  (Figures  4E 
and  34A). 

Paper-based  synthetic  gene  networks  are  also  potentially  less 
expensive  to  manufacture  than  most  of  the  standard  of  care  op¬ 
tions  currently  available.  At  the  moment,  the  cost  of  a  1  pi  paper- 
based  sensor  would  be  between  35c  and  65c  using  commercial 
cell-free  expression  systems.  However,  these  systems  can  be 
readily  produced  in  house,  reducing  the  cost  to  as  little  as  2c 
to  4c  per  sensor  (Noireaux,  2013).  This  compares  to  $0.45  to 
$1.40  for  a  single  RDT  reaction  and  $1,50  to  $4.00  (reagents 
only)  for  PCR  (Cord ray  and  Rlchards-Kortum,  2012).  Transcrip¬ 
tion-  and  translation- based  detection  Is  also  competitive  with  re¬ 


gards  to  time  to  detection.  As  an  example,  detection  of  mRNA 
from  the  ampicillin  resistance  gene  was  recorded  as  early  as 
25  min  for  high  concentrations  of  mRNA  and  about  40  min  for 
our  3  nM  treatment  (Figures  4E  and  S4B).  This  compares  favor¬ 
ably  to  RDTs,  which  can  detect  a  single  antigen  in  ~2Q  min  or 
PCR,  which  can  take  at  least  1  hr  and  is  largely  confined  to  lab¬ 
oratory  settings  (Cordray  and  Richards- Kortum,  2012). 

Our  construction  and  testing  of  a  converging  transcription 
cascade  on  freeze-dried  paper  discs  (Figure  6)  further  under¬ 
scores  the  potential  of  this  approach  to  host  complex  network 
reactions  for  both  laboratory  and  field  applications.  Although 
perhaps  not  at  first  obvious,  for  each  of  the  three  transcription 
cascade  reactions  to  produce  GFP  (Figures  6B-6D),  the  system 
had  to  essentially  cycle  through  two  cycles  of  the  central  dogma. 
The  first  cycle  converts  the  DNA  of  the  cascade  module(s)  to 
RNA  and  then  protein  (RNAP),  which  sets  off  the  second  cycle 
that  converts  the  DN  A  of  dormant  reporter  to  RNA  and  then  pro¬ 
tein  (GFP),  Thus,  despite  their  small  size,  the  paper  discs  have 
the  transcription  and  translation  capacity  required  for  complex 
tasks.  For  synthetic  biology  applications,  such  a  transcription 
cascade  could  be  used  to  build  more  sophisticated  networks 
with  layers  of  outputs  that  remain  inactive  unless  triggered  by 
a  circuit  producing  the  correct  RNAPr  Further,  as  the  orthogo¬ 
nality  (Figures  2E,  S2G,  S2H,  and  S3B)  and  Ebola  screens 
showed  (Figure  5),  the  paper-based  system  has  significant  po¬ 
tential  to  increase  the  pace  at  which  genetically  encoded  tools 
can  be  built  and  tested.  With  this  in  mind,  we  foresee  this  tech¬ 
nology  being  extended  to  prototyping  of  engineered  metabolic 
pathways  and  other  complex  gene  circuits  as  a  way  to  rapidly 
vet  combinatorial  designs  before  moving  to  cellular  hosts. 
Thus,  ready -to -use  pa  per -based  systems  could  not  only  make 
tools  currently  only  available  in  the  laboratory  readily  fieldable 
but  also  improve  the  development  of  new  tools  and  the  accessi¬ 
bility  of  these  molecular  tools  to  educational  programs  for  the 
next  generation  of  practitioners. 

Taken  together  and  considering  the  projected  cost,  reaction 
time,  ease  of  use,  and  no  requirement  for  laboratory  infrastruc¬ 
ture,  we  envision  paper-based  synthetic  gene  networks  signifi¬ 
cantly  expanding  the  role  of  synthetic  biology  in  the  clinic,  global 
health,  industry,  research,  and  education. 

EXPERIMENTAL  PROCEDURES 

TetR-Based  Inducible  Synthetic  Gene  Circuits 

The  TetR-based  GFP,  mCherry,  and  chitinase  expression  circuits  were  con¬ 
structed  by  inserting  GFP,  mCherry,  or  chitinase  downstream  of  the  TetR- 
repressed  pUetO  promoter  in  pZEll  (Lutz  and  Bujard,  1937).  Tq  provide 
constitutive  TetR  expression,  Tefft  was  cloned  downstream  of  the  constitutive 
pfaclG  promoter,  and  this  expression  cassette  was  inserted  into  pZEll-gfp 
and  pZ El  1  -mcherry  using  Xhol  and  Aatli. 

Toehold  Switch  Construction 

Toehold  switch  plasmids  were  constructed  using  conventional  molecular 
biology  techniques.  Synthetic  DNA  templates  (Integrated  DNA  Technologies) 
were  amplified  using  PCR,  inserted  into  plasmids  using  Gibson  assembly 
(Gibson  et  at.,  2009)  with  30  bp  overlap  regions,  and  then  successfully  con¬ 
structed  plasmids  identified  using  DNA  sequencing.  Ail  plasmids  were  derived 
from  pET  system  parent  plasmids  (EMD  Millipore]  and  constitutively  express 
tad  and  antibiotic  resistance  genes.  Additional  descriptions  of  the  toehold 
switch  plasmids  and  their  sequences  are  provided  in  Green  et  al.  (2014). 
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Preparation  of  Matrix  Materials 

Although  our  initial  paper- based  reactions  were  successful,  we  suspected  that 
nonspecific  interactions  between  the  components  of  the  cell-free  system  and 
the  cellulose  matrix  of  paper  could  be  impeding  the  activity  of  the  reactions 
Cellulose  holds  a  pH -dependent  charge  (Budd,  and  Herrington,  1989).  and 
thus  it  seemed  likely  that  components  from  the  complex  biomolecu I ar  systems 
could  adsorb  to  the  high  surface  area  of  cellulose  fibers,  reducing  reaction  ef¬ 
ficiency.  Accordingly,  we  treated  the  filter  paper  with  bovine  serum  albumin 
(BSA)  and  other  commonly  used  blocking  reagents,  We  found  an  overall 
improvement  in  fluorescent  output  from  treated  paper,  with  5%  BSA  yielding 
the  greatest  increase  [Figures  S5A  and  55 B),  For  quartz  microfiber,  an  alter¬ 
nate  substrate,  the  performance  enhancement  of  blocking  with  the  surfactant 
Tween-20  was  even  more  pronounced  (Figures  S5C  and  S5D). 

Cett-Free  Reactions 

Cell-free  reactions  were  assembled  {4*0  as  generally  described  in  the  instruc¬ 
tions  of  the  respective  manufacturers  and  then  immediately  incubated  at  37*  C. 
Briefly,  for  S30  and  S3GT7  reactions  {Pro mega,  Li 020  and  L1 1 1 0),  cell  extracts 
and  premix  containing  amino  acids  were  combined  at  a  ratio  of  0.33  and  0.51 . 
respectively.  The  volume  was  then  brought  up  with  RNase  inhibitor  (Roche; 
0.005).  plasmid  DNA  constructs  comprising  the  gene  circuits,  and  nuclease- 
free  ddHjO.  For  tetObased  gene  circuits,  reactions  were  supplemented 
with  prerun  cell-free  reactions  expressing  constitutive  TetR  (0.05)  to  reduce 
promoter  leakage.  For  the  PT7  cell-free  system  (NEB.  E6800L),  solutions  A 
(0  4)  and  B  [0,3)  were  combined  together,  with  the  remaining  volume 
composed  of  RNase  inhibitor  (Roche;  0.005).  linear  DNA  constructs,  and 
nuc tease-free  ddH2Q.  HeLa  cell  cell -free  systems  {Thermo  Scientific,  86881) 
were  assembled  by  combining  HeLa  cel!  lysate  (0.5),  accessory  proteins 
(0.1),  and  reaction  mix  (0.2)  with  RNase  inhibitor  (Roche;  0.005),  plasmid 
DNA  constructs  comprising  the  gene  circuits,  and  nuclease-free  ddH20. 
Long-term  storage  of  freeze-dried  pellets  was  done  protected  from  light,  under 
nitrogen  gas  and  in  the  presence  silica  gel  desiccation  packages.  For  colori¬ 
metric  reactions,  the  enzymatic  substrates  chlorophenol  red-ft-D-galactopyr- 
anoside  and  4-Nitrophenyi  KN-diacetyi-ft-Dchitobioside  were  supplied  to 
the  reactions  prior  to  freeze  drying  at  final  concentrations  of  0,6  mg/ml  and 
3  mg/ml,  respectively. 

The  concentration  for  the  gene  circuits  were  as  follows:  Figures  IB  and  10, 
T7_GFP  plasmid  DNA  5  ngV  (pBR939b_T7_GFP;  Anderson  et  a!.,  2007);  Fig¬ 
ure  IE,  PN25-GFP  plasmid  DNA  30  ngAil  and  T7_GFP  plasmid  DNA  30  ng/gi: 
Figure  lGr  TetO-GFP  and  TetO-mCherry  plasmid  DNA  30  ng/gl.  Toehold 
switch  and  sensor  sequences  are  available  as  a  supplemental  information 
file  {Data  S2);  Figure  2,  linear  DNA  33  nM,  RNA  triggers  5  gM;  Figure  3,  linear 
DNA  33  nM,  RNA  triggers  5  gM  or  as  specified;  Figures  4B  and  4C,  mCherry 
and  GFR  mRNA  sensors  linear  DNA  33  nM,  respective  mRNAs  2,5  gM;  Figures 
4D  and  4E,  antibiotic  resistance  gene  mRNA  sensors  for  spectinomycin,  chlor¬ 
amphenicol.  kanamycin,  and  ampicillin  linear  DNA  33  nM,  respective  mRNAs 
3  gM  or  as  specified;  Figure  4F,  pT7CFE1_GFP  plasmid  DNA  (Thermo  Scien¬ 
tific)  30  ng/gl,  pcDNA3.1  FLIPglu-30uDefta13V  plasmid  DNA  (Takanaga  and 
Frommer.  2010;  Addgene:18015)  30  ng/gl;  Figure  5,  PGR  product  was  not 
gel  purified,  so  a  linear  DNA  concentration  of  1 50  ng/gl  was  used  to  ensure 
adequate  sensor  product,  respective  trigger  RNAs  3  gM  or  as  specified;  Fig¬ 
ure  6,  T3  and  T7  cascade  modules  plasmrd  DNA  30  ng/gl  (E  coii  RNAPjrig- 
ger_1 ,  E  coii  RNAP_switch_1  T3RNAP,  E  coii  RNAPJrigger_2t  E  coU 
R N AP_s witch_2_T7 RNAP) ,  T3_GFP  and  T7_GFP  plasmid  DNA  40  ng/gl,  T3 
RNAP_tngger_3,  and  T7  RNAP_s  witch  3_GFP  plasmid  DNA  40  ng/gl. 

Preparation  of  Reactions  and  incubation 

Assembled  cell-free  reactions  were  applied  {1.6  gl/disc)  to  2  mm  paper  discs 
(Whatman,  1442-042),  which  were  then  flash  frozen  in  liquid  nitrogen  and 
freeze  dried  overnight.  Paper  discs  were  cut  using  a  2  mm  biopsy  punch.  Simi¬ 
larly.  quartz  microfiber  (Spectrum.  864-66171)  was  cut  and  treated  with  cell- 
free  reactions  (3  gl/disc)  prior  to  freeze  drying.  Freeze -dried  solution  phase 
reactions  [7  pi)  were  similarly  flash  frozen  and  placed  on  the  lyophilizer  to 
create  pellets.  After  24  hr.  paper  reactions  for  Figures  1, 2,  3.  and  4  were  rehy- 
d  rated  with  either  nuclease- free  ddH20  or  inducer  at  the  concentrations  spec¬ 
ified.  For  these  reactions,  DNA  for  gene  circuits  was  added  to  paper  before 
freeze  drying.  For  Figure  5  reactions,  DNA  containing  the  sensor  and  RNA  trig¬ 


gers  were  both  added  during  rehydration.  For  Figure  6  reactions,  DNA  for  the 
GFP  reporters  were  added  prior  to  freeze  drying  (T3_GFP,  T7J3FP,  or  13/ 
T7„GFP).  Rehydration  of  the  paper  discs  was  then  done  in  the  presence  or 
absence  of  DNA  constructs  for  the  T3  and/or  T7  cascade  modules.  Cell-free 
reactions  without  synthetic  gene  networks  were  also  included  to  provide  the 
background  signal  for  subtraction  from  control  and  treatment  reactions.  Rehy¬ 
drated  reactions  were  incubated  at  37* C  using  either  a  plate  reader  (BioTek 
NEO  HTS)  or  our  in-house  electronic  optical  reader  placed  inside  a  tissue  cul¬ 
ture  incubator.  For  the  plate  reader,  paper  discs  were  piaced  into  black,  clear- 
bottom  364  well  plates,  and  for  the  in-house  reader,  paper  discs  were  placed 
into  2  mm  holes  in  the  removable  acrylic  chip  (Figures  3H  and  S3H).  Cell- based 
reactions  presented  in  Figure  32 C  were  performed  in  E  coii  as  described  by 
Green  et  aL  (2014),  Bnefly,  log-phase  cultures  containing  a  toehold  switch 
construct  and  either  an  on-target  or  off-target  IPTG-inducible  RNA  trigger 
construct  were  diluted  1:100,  Cells  were  grown  at  37aC  overnight  in  the  pres¬ 
ence  or  absence  of  1  mM  1PTG.  and  GFP  expression  was  measured  using  a 
plate  reader. 

Microscopy  and  Image  Processing 

images  of  paper  discs  were  collected  on  a  Zeiss  Axio  Zoom  VI 6  Macroscope 
(magnification  7x)  with  an  AxioCam  MRm  in  a  humidified  glass  chamber  or 
through  the  bottom  of  a  clear-bottom  384 -well  plate.  Collected  images  were 
then  stitched  together  using  Zeiss  Zen  software  into  large  composite  images 
for  further  processing  in  ImageJ  Experiments  were  arranged  so  that  images 
of  matching  control  and  treatment  paper-based  reactions  were  collected 
together  such  that  parameters  could  be  adjusted  for  all  samples  simulta¬ 
neously.  Once  optimized,  images  of  individual  paper  discs  were  cropped 
and  arrayed  into  figures.  For  GFP  expression  in  the  S30  cell-free  system,  which 
exhibits  a  high  level  of  autofluorescence,  we  used  a  Nuance  camera  to  collect 
multispectraE  images  between  500  and  620  nm.  Perkin  Elmer  Nuance  3,0.2 
software  was  then  used  to  unmix  the  spectral  signature  of  the  GFP  from  that 
of  the  cell  extract.  A  similar  approach  was  used  to  create  a  absorbance  signa¬ 
ture  (420  to  720  nm)  for  imaged  paper  discs  with  p-nitrophenol,  the  chili nase 
cleavage  product  of  4-Nstrophenyl  N,N‘-diacety1-flrD-chitobioside.  images 
collected  with  the  Nuance  camera  were  scaled  using  bilinear  transformation. 
For  a  few  composite  images,  areas  around  the  paper  discs  were  masked  to 
remove  extraneous  signal. 

Printed  Arrays 

Patterns  for  printed  arrays  were  generated  using  Adobe  illustrator  and  then 
printed  onto  chromatography  paper  (Whatman,  3001-861)  using  a  Xerox 
8570  printer  (Carrilho  et  al.,  2009).  Once  printed,  the  wax  was  reflowed  using 
a  hot  plate  (120* C)  so  that  the  wax  was  present  throughout  the  entire  thickness 
of  the  paper,  creating  hydrophobic  barriers  to  contain  each  reaction.  The  print¬ 
ing  was  performed  at  the  Center  for  Nanoscale  Systems  at  Harvard  University, 
a  member  of  the  National  Nanotechnology  Infrastructure  Network  (NNIN), 
which  is  supported  by  the  National  Science  Foundation  under  NSF  award 
ECS-  0335765. 

Electronic  Optical  Reader 

The  portable  device  consists  of  four  layers  housed  within  a  laser-cut  acrylic 
box  (Figure  3H).  The  top  layer  holds  12  LEDs  (Digi-Key,  754-1 262- ND),  which 
have  a  very  narrow  viewing  angle  and  an  emission  of  570  nm  to  match  the 
absorbance  maximum  of  the  product  of  the  LacZ  reaction.  The  LEDs  were 
placed  in  close  proximity  to  the  chip  in  the  middle  layer,  which  holds  12  paper 
disks  within  2  mm  apertures  The  apertures  prevented  transmission  of  stray 
light  and  were  coaxial  with  the  LEDs  in  the  top  layer  and  the  array  of  12 
TSL2561  sensors  (Adafruit,  439)  tn  the  third  layer  below.  The  bottom  layer  con¬ 
tains  the  Arduino  Micro  and  associated  electronics  such  as  the  multiplexers 
(Sparkfun,  BOB-09056),  breadboard,  resistors,  and  connectors.  To  prevent 
crosstalk  between  reads,  reactions  were  read  in  series  by  sequentially  acti¬ 
vating  each  LED  and  sensor  pair.  The  read  frequency  and  pattern  of  the  reader 
can  be  easily  adjusted  by  modifying  and  uploading  alternative  sketches  to  the 
Arduino  Micro.  Both  the  raw  data  and  the  data  processed  using  the  per  disc 
formula:  1Q0-(1QQlCurrent/Max))  were  calculated  for  each  time  point  and 
sent  to  a  laptop.  A  diagram  of  the  circuit  and  an  overview  of  the  laser  cut  parts 
can  be  found  in  the  supplemental  figures  (Figures  $3G  and  53H)  and  laser 
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cutting  patterns  and  Arduino  sketch  files  are  provided  as  supplemental  infor¬ 
mation  files  (Data  Si). 

Calculation  of  Fold  Change 

Fluorescence  and  absorbance  data  were  first  smoothed  to  reduce  measure¬ 
ment  noise  using  a  moving  three-point  average  of  the  time  point  and  the 
data  both  preceding  and  following  that  read.  Background  signal  was  sub¬ 
tracted  from  each  well  using  the  average  of  control  blank  reactions  that  con¬ 
tained  the  relevant  cell- free  system  alone.  The  minimum  value  of  each  well 
was  then  adjusted  to  zero.  For  fluorescence  data,  fold  change  calculations 
were  done  by  dividing  the  wells  at  each  time  point  by  the  average  signal 
from  the  corresponding  uninduced  control  wells.  For  absorbance  measure¬ 
ments,  fold  change  values  reflect  the  difference  in  the  rate  of  color  change  be¬ 
tween  induced  and  uninduced  wells.  This  was  done  by  calculating  the  rate  of 
change  using  slope;  therefore,  at  each  1 0  min  time  point,  the  rate  reported  was 
calculated  as  follows:  Sn  =  (Tn-^-TnJ/IG,  where  T  is  the  normalized  data  at  a 
time  point  (T^)  and  the  time  point  10  min  later  (rn+1),  and  Sn  is  the  slope  re¬ 
ported  for  Trt.  Fold  change  was  then  calculated  as  with  the  fluorescence 
data.  Experiments  were  run  in  either  triplicate  or  quadruplicate.  Due  to  high  au¬ 
tofluorescence  of  the  S3Q  reactions,  a  few  experiments  exhibited  high  vari¬ 
ability  and  were  accordingly  subjected  to  the  Modified  Thompson  Tau  Test 
to  identify  outliers  to  be  discarded  [Anbarasi  at  aL,  2011;  Byrd  at  al.P  2014), 
Briefly,  the  Tau  test  was  performed  by  calculating  the  difference  (delta) 
between  a  measured  value  (replicate)  and  the  mean  of  the  group.  Using  the 
Modified  Thompson  Tau  value  for  quadruplicate  data  [1,4250).  the  test  was 
performed  by  multiplying  the  SD  of  the  group  by  this  Tau  value.  If  the  resulting 
number  was  smaller  than  the  delta  calculated  fora  measured  value,  that  repli¬ 
cate  was  considered  an  outlier  and  was  not  included  in  the  analysis. 

SUPPLEMENTAL  INFORMATION 

Supplemental  Information  includes  Extended  Experimental  Procedures,  five 
figures,  and  two  data  files  and  can  be  found  with  this  article  online  at  http:# 
dx  doi .  org/1 0, 1 01 6/j  .cell  .2014.10 ,004. 
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SUMMARY 

Efforts  to  construct  synthetic  networks  in  living 
cells  have  been  hindered  by  the  limited  number  of 
regulatory  components  that  provide  wide  dynamic 
range  and  low  crosstalk.  Here,  we  report  a  class  ot 
de-novo-designed  prokaryotic  riboregulators  called 
toehold  switches  that  activate  gene  expression  in 
response  to  cognate  RNAs  with  arbitrary  sequences. 
Toehold  switches  provide  a  high  level  of  orthogo¬ 
nality  and  can  be  forward  engineered  to  provide 
average  dynamic  range  above  400.  We  show  that 
switches  can  be  integrated  into  the  genome  to  regu¬ 
late  endogenous  genes  and  use  them  as  sensors  that 
respond  to  endogenous  RNAs.  We  exploit  the  or¬ 
thogonality  of  toehold  switches  to  regulate  1 2  genes 
independently  and  to  construct  a  genetic  circuit  that 
evaluates  4-input  AND  logic.  Toehold  switches,  with 
their  wide  dynamic  range*  orthogonality,  and  pro¬ 
grammability,  represent  a  versatile  and  powerful 
platform  for  regulation  ot  translation,  offering  diverse 
applications  in  molecular  biology,  synthetic  biology, 
and  biotechnology. 

INTRODUCTION 

Synthetic  biology  seeks  to  apply  engineering  design  principles  to 
predict  and  control  the  behavior  of  living  systems.  Synthetic 
gene  networks  have  been  used  to  construct  a  wide  range  of  bio¬ 
logical  devices,  including  molecular  counters  (Fried land  et  al., 
2GG9),  oscillators  (Elowitz  and  Lei  bier,  2000),  toggle  switches 
(Gardner  et  al.,  2000),  logic  gates  (Auslander  et  al.,  2012;  Moon 
et  aL,  2012;  Siuti  et  al.,  2013:  Win  and  Smolke,  2008),  celt  classi¬ 
fiers  (Xie  et  al. r  201 1  )t  and  analog  signal  processors  (Daniel  et  al. , 
2013).  Despite  these  developments,  an  underlying  problem  in 
synthetic  biology  remains  the  limited  number  of  com  posable, 
high-performance  parts  for  constructing  genetic  circuits  and  dif¬ 
ficulties  that  arise  when  integrating  multiple  components  into  a 
large,  complex  synthetic  network  (Purnickand  Weiss,  2009).  Un¬ 
like  electronic  circuit  elements,  which  can  be  electrically  and 
spatially  insulated  from  each  other,  biological  components  can 


interact  with  one  another  in  the  complex  cellular  environment 
and  suffer  from  unwanted  crosstalk  between  components.  Lim¬ 
itations  imposed  by  the  number  and  orthogonality  of  available 
biological  circuit  components  hinder  the  construction  of  more 
complex  circuits  that  can  operate  robustly  in  living  cells.  Con¬ 
sequently,  new  classes  of  regulatory  components  that  offer 
wide  dynamic  range,  low  system  crosstalk,  and  design  flexibility 
represent  a  much-needed,  enabling  step  toward  fully  realizing 
the  potential  of  synthetic  biology  in  areas  such  as  biotechnology 
and  medicine  (Khalil  and  Collins,  2010)* 

RNA-based  regulatory  elements  offer  a  potential  solution  to  this 
component  bottleneck.  Biological  parts  constructed  from  RNA 
take  advantage  of  predictable  Watson -Crick  base  pairing  to  con¬ 
trol  cellular  behavior  and  can  harness  sophisticated  software 
tools  for  predicting  RNA-RNA  interactions  Nature  has  already 
developed  a  wide  assortment  of  RNA-based  parts  operating  at 
the  transcriptional  and  posttranscriptional  level  (Brantl  and  Wag¬ 
ner,  2000:  Gultyaevet  a!.,  1997;  Winkler  eta!.,  2002).  Starting  from 
these  natural  systems,  researchers  have  developed  a  number  of 
engineered  RNA  regulatory  elements,  including  riboregulators 
that  control  translation  and  transcription  in  response  to  cognate 
RNAs  (Callura  et  ai.,  2010,  2012;  Isaacs  et  al.,  2004;  Lucks 
et  al.,  2011:  Mutalik  et  ai.P  2012;  Rodrigo  et  al.,  2012),  and 
mRNA  transducers  that  convert  signals  from  small -molecule  or 
protein  ligands  to  protein  outputs  (Bayer  and  Smolke.  2005:  Culler 
et  al..  2010;  Qi  et  al*.  2012:  Win  and  Smolke,  2008). 

Engineered  riboregulators  consist  of  cognate  pairs  of  RNAs;  a 
transducer  strand  that  regulates  translation  or  transcription  and 
a  frans- acting  RNA  that  binds  to  the  transducer  to  modulate  its 
biological  activity.  Ribo regulator  designs  can  be  classified  ac¬ 
cording  to  the  initial  RNA-RNA  interaction  that  drives  hybridiza¬ 
tion  between  the  transducer  and  trans -acting  RNAs.  Reactions 
initiated  between  loop  sequences  in  both  RNAs  are  termed 
loop-loop  interactions,  whereas  those  that  occur  between  a 
loop  sequence  and  an  unstructured  RNA  are  termed  loop-linear 
(Takahashi  and  Lucks,  2013). 

A  common  limitation  for  riboregulators  has  been  their  dynamic 
range  (Liu  et  al.,  2012).  Previous  prokaryotic  translational  ribore- 
gutators  have  typically  modulated  biological  signals  by  up  to  a 
maximum  of  ^55-fold  for  activators  (Callura  et  aL,  2012)  and 
up  to  — 10-fold  for  repressors  (Mutalik  et  al  ,  2012).  In  contrast, 
protein-based  transcriptional  regulators  have  demonstrated 
dynamic  ranges  over  an  order  of  magnitude  higher,  with 
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Figure  1.  Toehold  Switch  Design  and  In  Vivo  Characterization 

(A  and  8)  Design  schematics  of  conventional  riboregulators  (A)  and  toehold  switches  (8),  Variable  sequences  are  shown  in  gray,  whereas  conserved  or 
constrained  sequences  are  represented  by  different  colors. 

(A)  Conventional  riboregulator  systems  repress  translation  by  base  pairing  directly  to  the  RBS  region.  RNA- RNA  interactions  are  initiated  via  a  loop-linear  or  loop- 
loop  interaction  at  the  YUNR  loop  in  an  RNA  hairpin. 

(B)  Toehold  switches  repress  translation  through  base  pairs  programmed  before  and  after  the  start  codon  (AUG),  leaving  the  RBS  and  start  codon  regions 
completely  unpaired.  RNA-RNA  interactions  are  initiated  via  linear-linear  interaction  domains  catted  toeholds.  The  toehold  domain  a  binds  to  a  complementary  a* 
domain  on  the  trigger  RNA.  Domains  a  and  b  are  12  and  13  ntsf  respectively. 

(C)  Flow  cytometry  GFP  fluorescence  histograms  for  toehold  switch  number  2  compared  to  E.  coir  autofluorescence  and  a  positive  control.  Autofluorescence 
level  measured  from  induced  cells  not  bearing  a  GFP-expressing  plasmid. 

(D)  GFP  mode  fluorescence  levels  measured  for  switches  in  their  ON  and  OFF  states  in  comparison  to  positive  control  constructs  and  autofluorescence.  Error 
bars  are  the  SD  from  at  least  three  biological  replicates. 


(legend  continued  on  next  page ) 
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widely-used  inducible  promoters  regulating  protein  expression 
over  350-fold  (Lutz  and  Bujard.  1997)  and  sigma  factor-promoter 
pairs  providing  up  to  480-fold  modulation  (Rhodius  et  al.r  2013). 
Despite  the  inherent  programmability  of  RNA-based  systems,  ef¬ 
forts  at  constructing  large  sets  of  orthogonal  riboregulators  have 
been  limited  to  libraries  of  at  most  seven  parts  with  crosstalk 
levels  of  ^20%  (Takahashi  and  Lucks,  2013),  Typical  RNA- 
based  regulators  employ  interaction  domains  consisting  of 
-30  nt$,  which  corresponds  to  a  sequence  space  of  over  1018 
potential  regulatory  elements.  Thus,  the  sheer  diversity  of 
possible  RNA-based  parts  suggests  that  previous  devices 
have  not  come  close  to  realizing  the  potential  of  highly  orthog¬ 
onal  regulation. 

Much  of  this  discrepancy  arises  from  the  significant  sequence 
constraints  imposed  on  riboregulators  engineered  thus  far  (Fig¬ 
ure  1  A).  Like  natural  riboregulators,  engineered  riboregulators 
of  translation  have  invariably  used  base  pairing  to  the  ribosome 
binding  site  (RBS)  to  prevent  ribosome  binding,  thereby  prevent¬ 
ing  translation  (CatJura  et  aL,  2012;  Isaacs  et  al.t  2004;  Mutalik 
et  aL,  2012;  Rodrigo  et  al.„  2012).  Because  repression  is  caused 
by  RBS  binding,  trigger  RIMAs  that  activate  translation  are  engi¬ 
neered  to  contain  an  RBS  sequence  to  displace  the  repressing 
sequence,  which  in  turn  reduces  the  potential  sequence  space 
for  the  riboreguiator. 

Previous  riboregulators  have  also  relied  on  U-turn  loop  struc¬ 
tures  to  drive  loop -loop  and  loop- Si  near  interactions  between 
RNAs  (Figure  1A)  (Gallura  at  aL,  2012;  Isaacs  et  aL,  2004;  Lucks 
et  aL,  2011;  Takahashi  and  Lucks,  2013),  U-turn  loops  are  com¬ 
mon  RNA  structural  motifs  formed  by  tertiary  interactions  that 
have  been  identified  In  ribozymes,  ribosomal  RNAs,  and  transfer 
RNA  anticodon  loops  (Gutell  et  al.P  2000).  Although  recent  work 
has  begun  to  show  that  loops  with  canonical  U-turn  sequences 
are  not  essential  for  riboregulators  (Mutalik  et  aL,  2012;  Rodrigo 
et  aL,  2012),  the  engineered  systems  reported  to  date  have 
continued  their  reliance  on  the  loop-mediated  RNA  interactions 
from  natural  systems.  Although  these  loop  interactions  have 
been  selected  by  evolution  in  nature,  alternative  approaches  em¬ 
ploying  I  inear- 1  inear  RNA  interactions  are  amenable  to  rational 
engineering  and  exhibit  more  favorable  reaction  kinetics  and 
thermodynamics,  factors  that  could  be  exploited  to  increase  ri- 
bo regulator  dynamic  range. 

To  address  these  fundamental  limitations,  we  designed  a 
class  of  de- novo-designed  riboregulators  that  enable  post  tran¬ 
scriptional  activation  of  protein  translation  through  mechanisms 
employed  in  artificial  systems  rather  than  natural  ones.  Based  on 
their  interaction  mechanism  and  near-digital  signal  processing 
behavior,  we  term  these  rj  bo  regulator  systems  toehold  switches. 
Unlike  conventional  riboregulators,  our  synthetic  riboregulators 
take  advantage  of  toehold -mediated  linear-linear  interactions 
developed  in  vitro  (Dirks  and  Pierce,  2004;  Yin  et  aL.  2008:  Yurke 
et  aL.  2000)  to  initiate  RNA-RNA  strand  displacement  interac¬ 
tions,  Furthermore,  they  rely  on  sequestration  of  the  region 


around  the  start  codon  to  repress  protein  translation,  eschewing 
any  base  pairing  to  the  RBS  or  start  codon  itself  to  regulate  trans¬ 
lation,  As  a  result,  toehold  switches  can  be  designed  to  activate 
protein  translation  in  response  to  a  trigger  RNA  with  an  arbitrary 
sequence,  enabling  substantial  improvements  in  component 
orthogonality.  The  absence  of  binding  to  the  RBS  and  use  of 
thermodynamically  favorable  linear-linear  interactions  also  en¬ 
ables  facile  tuning  of  translational  efficiency  via  RBS  engineer¬ 
ing,  Consequently,  these  systems  routinely  enable  modulation 
of  protein  expression  over  two  orders  of  magnitude. 

Here,  we  demonstrate  the  utility  of  toehold  switches  by  vali¬ 
dating  more  than  100  functional  systems  in  E.  colL  We  exploit 
the  expanded  RNA  sequence  space  afforded  by  the  new  ribore- 
gulator  class  to  construct  libraries  of  components  with  unprece¬ 
dented  orthogonality,  including  a  set  of  26  systems  that  exhibit 
less  than  12%  crosstalk.  Heuristic  design  principles  are  used  to 
generate  forward -engineered  switches  that  exhibit  average  ON/ 
OFF  ratios  exceeding  400,  a  dynamic  range  typically  reserved 
for  protein-based  transcriptional  regulators.  We  also  use  toehold 
switches  to  sense  endogenous  RNAs  in  vivo  and  to  regulate 
endogenous  gene  expression  by  integrating  them  into  the 
genome.  For  potential  applications  in  synthetic  biology,  we 
demonstrate  that  toehold  switches  can  be  used  to  regulate  a 
dozen  components  in  the  cell  at  the  same  time  and  to  construct 
a  genetic  circuit  that  computes  a  4-input  AND  expression.  The 
versatility,  dynamic  range,  orthogonality,  and  programmability  of 
toehold  switches  provide  them  with  the  potential  to  become  a 
powerful  platform  for  sensing  and  programming  the  internal  states 
of  living  cells.  We  anticipate  that  these  devices  wifi  be  enabling 
tools  for  constructing  increasingly  complex  genetic  circuits  in  syn¬ 
thetic  biology. 

RESULTS 

Design  of  a  Biological  Device  from  First  Principles 

A  riboregulator  that  activates  gene  expression  must  switch  from 
a  secondary  structure  that  prevents  translation  to  a  configu¬ 
ration  that  promotes  translation  upon  binding  of  a  cognate 
frans-acting  RNA.  We  developed  toehold  switches  based  on 
previous  reports  describing  the  influence  of  secondary  structure 
on  mRNA  translation  and  knowledge  of  RNA  thermodynamics 
and  kinetics. 

Although  the  Shine-Dalgarno  sequence  is  an  important  factor 
in  determining  the  efficiency  of  translation  from  a  given  mRNA, 
studies  have  found  that  secondary  structure  in  regions  nearby 
the  start  codon  also  plays  a  critical  rote  (Kudia  et  aL,  2009). 
Furthermore,  genome-wide  analyses  have  revealed  strong 
biases  toward  low  secondary  structures  around  the  start  codon 
of  mRNAs  from  a  panel  of  hundreds  of  bacterial  genomes  (Ben- 
teie  et  aL,  2013).  Thus,  we  ensured  that  toehold  switches 
sequestered  the  region  around  the  start  codon  to  repress  trans¬ 
lation.  rather  than  binding  to  either  the  RBS  or  the  start  codon. 


fE)  ONVOFF  GFP  fluorescence  levels  obtained  3  hr  after  induction  for  16S  first-generation  toehold  switches.  Inset:  ON/OFF  GFP  fluorescence  measured  for 
toehold  switches  of  varying  performance  levels  at  different  time  points  following  induction.  Relative  errors  for  the  switch  ON/OFF  ratios  were  obtained  by  adding 
the  relative  errors  of  the  switch  ON  and  OFF  state  fluorescence  measurements  in  quadrature  Relative  errors  for  ON  and  OFF  states  are  from  the  SD  of  at  least 
three  biological  replicates. 

See  also  Figure  ST  and  Table  SI  . 
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Instead  of  using  loop  regions  to  initiate  interactions,  we  recog¬ 
nized  the  design  advantages  afforded  by  linear-linear  nucleic 
acid  interaction  strategies  developed  in  vitro  and  focused  in 
particular  on  toehold-based  strand  displacement  reactions 
(Yurke  et  al.p  2000),  In  these  systems,  interactions  between 
reaction  species  are  kinetically  controlled  through  hairpins  or 
multistranded  complexes  that  feature  exposed  srngte-stranded 
domains  called  toeholds.  These  domains  serve  as  reaction  initi¬ 
ation  sites  for  input  nucleic  acid  species  and  do  not  require 
U-turn  structural  motifs  to  increase  their  accessibility.  Such 
toehold  mechanisms  have  been  extensively  used  to  engineer 
complex  dynamic  systems  utilizing  reconfigurable  DNA  hairpins 
in  test  tubes  (Dirks  and  Pierce,  2004;  Yin  et  at.,  2008)  and  RNA 
hairpins  in  fixed  tissues  (Choi  et  al.r  2010)  and  for  complex 
in  vitro  information  processing  systems  (Qian  and  Winfree, 
2011;  Qian  et  al„  2011;  Yin  et  aL,  2008). 

Toehold  switch  systems  are  composed  of  two  RNA  strands 
referred  to  as  the  switch  and  trigger  (Figure  IB).  The  switch 
RNA  contains  the  coding  sequence  of  the  gene  being  regulated. 
Upstream  of  this  coding  sequence  is  a  hairpin -based  processing 
module  containing  both  a  strong  R8S  and  a  start  codon  that  is 
followed  by  a  common  21  nt  linker  sequence  coding  for  low-mo- 
lecular-weight  amino  acids  added  to  the  N  terminus  of  the  gene 
of  interest,  A  single-stranded  toehold  sequence  at  the  5'  end  of 
the  hairpin  module  provides  the  initial  binding  site  for  the  trigger 
RNA  strand.  This  trigger  molecule  contains  an  extended  single- 
stranded  region  that  completes  a  branch  migration  process 
with  the  hairpin  to  expose  the  RBS  and  start  codon,  thereby  initi¬ 
ating  translation  of  the  gene  of  interest. 

The  hairpin  processing  unit  functions  as  a  repressor  of  trans¬ 
lation  in  the  absence  of  the  trigger  strand.  Unlike  previous  ribor- 
egulators,  the  RBS  sequence  is  left  completely  unpaired  within 
the  11  nt  loop  of  the  hairpin.  Instead,  the  bases  immediately 
before  and  after  the  start  codon  are  sequestered  within  RNA  du¬ 
plexes  that  are  6  bp  and  9  bp  long,  respectively.  The  start  codon 
itself  is  left  unpaired  in  the  switches  we  tested,  leaving  a  3  nt 
bulge  near  the  midpoint  of  the  18  nt  hairpin  stem.  Because  the 
repressing  domain  b  (Figure  1 B)  does  not  possess  complemen¬ 
tary  bases  to  the  start  codon,  the  cognate  trigger  strand  in  turn 
does  not  need  to  contain  corresponding  start  codon  bases, 
thereby  increasing  the  number  of  potential  trigger  sequences. 
The  sequence  in  the  hairpin  added  after  the  start  codon  was 
also  screened  for  the  presence  of  stop  codons,  as  they  would 
prematurely  terminate  translation  of  the  gene  of  interest  when 
the  riboregulator  was  activated.  We  employed  a  12  nt  toehold 
domain  at  the  5'  end  of  the  hairpin  to  initiate  its  interaction  with 
the  cognate  trigger  strand.  The  trigger  RNA  contains  a  30  nt  sin¬ 
gle-stranded  RNA  sequence  that  is  complementary  to  the 
toehold  and  stem  of  the  switch  RNA. 

In  Sillco  Design  of  Toehold  Switches 

We  used  the  NUPACK  nucleic  acid  sequence  design  package 
(Zadeh  et  al.,  2011)  to  generate  libraries  of  de -novo- designed 
translational  activators  satisfying  the  desired  device  parameters 
(see  section  S6  of  the  Extended  Experimental  Procedures  and 
Figure  SIC  available  online).  We  initially  designed  a  set  of  24 
toehold  switches  (see  Table  Si  for  RNA  sequences)  to  gauge 
in  vivo  performance. 


We  then  designed  an  extended  library  of  toehold  switches 
containing  orthogonal  elements  selected  for  low  crosstalk  with 
the  rest  of  the  library.  The  orthogonal  library  was  generated  by 
assembling  a  set  of  646  unique  toehold  switch  designs  and 
simulating  all  41 7,316  pairwise  interactions  in  the  set  to  evaluate 
crosstalk  interactions  (see  section  S6. 4  of  the  Extended  Experi¬ 
mental  Procedures  and  Figure  SID).  A  Monte  Carlo  algorithm 
was  used  to  identify  a  subset  of  1 44  switches  that  exhibited 
the  lowest  level  of  overall  component  crosstalk  (see  Table  SI 
for  RNA  sequences).  The  resulting  library  of  orthogonal  regula¬ 
tors  provided  a  large  set  of  components  to  independently  regu¬ 
late  translation  in  vivo  and  comprised  devices  well  separated 
from  one  another  in  the  sequence  space  to  provide  valuable  in¬ 
formation  regarding  sequence-dependent  effects. 

In  Vivo  Component  Validation 

The  toehold  switches  were  tested  in  E.  coli  BL21  Star  DE3  with 
the  switch  and  trigger  RNAs  expressed  from  separate  medium 
and  high  copy  plasmids,  respectively.  Although  BL21  Star  DE3 
is  an  RNase- deficient  strain,  we  found  that  toehold  switches 
also  performed  well  in  strains  with  wild-type  RNase  levels  (see 
section  S10  of  the  Extended  Experimental  Procedures,  Fig¬ 
ure  S2C,  and  Table  S3).  Expression  of  both  strands  was  induced 
using  IPTG,  which  triggered  production  of  both  RNA  species 
through  T7  RNA  polymerase,  GFP  was  used  to  characterize 
switch  output  performance  via  flow  cytometry. 

Representative  flow  cytometry  histograms  of  GFP  output 
from  toehold  switch  number  2  are  shown  in  Figure  1C.  ON  state 
fluorescence  measured  from  cells  expressing  both  the  switch 
and  its  cognate  trigger  was  near  the  fluorescence  intensity  of 
control  constructs  with  unrepressed  GFP  expression.  OFF  state 
fluorescence  from  cells  expressing  the  switch  and  a  noncognate 
trigger  revealed  a  slight  increase  in  GFP  levels  over  E  coii 
autofluorescence.  The  mode  fluorescence  value  from  these  his¬ 
tograms  was  used  to  calculate  the  ON/OFF  ratios  of  each 
toehold  switch  design  (see  section  S3  of  the  Extended  Experi¬ 
mental  Procedures  and  Figure  SIB).  Cell  autofluorescence 
was  not  subtracted  from  either  the  ON  or  OFF  state  fluorescence 
for  determination  of  ON/OFF  ratios. 

Figure  1 D  presents  the  mode  GFP  fluorescence  obtained  from 
three  high-performance  toehold  switches  in  their  ON  and  OFF 
states.  The  OFF  state  fluorescence  of  the  three  switches  was 
near  the  background  autofluorescence  levels  measured  for 
induced  cells  not  expressing  GFP.  The  ON  state  fluorescence 
for  the  toehold  switches  was  comparable  to  the  fluorescence 
obtained  from  positive  controls  that  recapitulated  the  secondary 
structure  of  the  toehold  switch  in  its  activated  state  (see  section 
S7  of  the  Extended  Experimental  Procedurest  Figure  S1E,  and 
Table  SI).  The  activated  toehold  switches  all  showed  higher 
GFP  output  than  a  standard  GFP  expression  construct  featuring 
a  completely  single-stranded  region  upstream  of  the  RBS  and 
gene,  likely  as  a  result  of  the  increased  mRNA  stability  afforded 
by  the  secondary  structures  of  the  switch  RNA  and  the  trigger/ 
switch  RNA  complex. 

Of  the  1 68  first-generation  systems  tested,  20  exhibit  ON/OFF 
ratios  exceeding  1 00,  and  nearly  two  thirds  display  at  feast  an 
ON/OFF  level  greater  than  10  (Figure  IE).  In  comparison,  we 
also  characterized  the  widely  used  engineered  riboreguiators 
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Figure  2.  Assessment  of  Toehold  Switch  Orthogonality 

(AJ  GFP  fluorescence  from  colonies  of  E.  coli  expressing  676  pairwise  combinations  of  switch  and  trigger  RNAs.  GFP-expressing  colonies  are  visible  as  green 
points  along  the  diagonal  in  cells  containing  cognate  switch  and  trigger  strands.  Off-diagonal r  noncognate  components  have  low  fluorescence. 

{B}  Crosstalk  measured  by  flow  cytometry  for  all  trigger-switch  combinations.  Crosstalk  was  determined  by  taking  GFP  output  measured  for  a  given  trigger- 
switch  combination  and  dividing  it  by  the  GFP  output  measured  for  the  switch  with  its  cognate  trigger. 

(G)  Comparison  of  overall  library  dynamic  range  and  orthogonal  library  size  for  the  toehold  switches  and  previous  riboregulators.  The  overall  library  dynamic  range 
corresponds  to  the  minimum  ON/OFF  ratio  to  expect  in  a  network  employing  this  library  of  switches. 

See  also  Table  S2. 


(Isaacs  et  al.,  2004)  crRNA  10  and  12  in  identical  conditions. 
These  earlier  riboregulators  exhibited  lower  dynamic  range 
with  ON/OFF  values  of  11  ±  2  and  13  ±  4  for  crRNA  systems 
10  and  12,  respectively  (see  section  S8  of  the  Extended  Experi¬ 
mental  Procedures  and  Figure  S1F).  Time-course  measure¬ 
ments  revealed  system  activation  within  1  hr  of  induction  and 
ON/OFF  ratios  increased  over  time  with  continued  production 
of  GFP  (Figure  1 E,  inset).  The  switches  were  also  successful  in 
regulating  four  different  output  proteins  {see  section  SI  0.1  of 
the  Extended  Experimental  Procedures  and  Figure  Si  H)« 

Evaluation  of  Toehold  Switch  Orthogonality 

To  measure  the  orthogonality  of  the  devices,  we  performed  addi¬ 
tional  in  silico  screening  to  isolate  a  subset  of  26  that  displayed 
extremely  low  levels  of  predicted  crosstalk  (see  section  S6.4  of 
the  Extended  Experimental  Procedures)  and  assayed  in  E.  coli 
all  676  pairwise  combinations  of  switch  and  trigger  plasmids. 
Figure  2A  displays  images  of  GFP  fluorescence  from  colonies 
of  E.  coli  induced  on  LB  plates  (see  section  S5  of  the  Extended 
Experimental  Procedures).  Clearly  visible  is  the  strong  emission 
from  cognate  switch  and  trigger  pairs  along  the  diagonal  of  the 
grid  with  the  final  switch  at  index  26  displaying  lower  fluores¬ 
cence  as  a  result  of  its  low  ON/OFF  ratio  Low  fluorescence 
levels  are  observed  for  the  off-diagonal  elements  featuring  non- 
cognate  trigger/s  witch  RNA  pairs. 


We  used  flow  cytometry  to  quantify  GFP  output  from  all 
trigger-switch  Interactions  (see  section  S4  of  the  Extended 
Experimental  Procedures).  Crosstalk  was  calculated  by  dividing 
the  GFP  fluorescence  obtained  from  a  noncognate  trigger  and  a 
given  switch  RNA  by  the  fluorescence  of  the  switch  in  its  trig¬ 
gered  state  (Figure  2B).  The  full  set  of  26  switches  tested  dis¬ 
played  less  than  12%  crosstalk.  Because  the  number  of  switches 
in  an  orthogonal  set  is  defined  by  its  threshold  crosstalk  level,  we 
identified  orthogonal  subsets  for  a  range  of  different  crosstalk 
thresholds  (see  section  S1 1  of  the  Extended  Experimental  Proce¬ 
dures  and  Table  52).  For  instance,  a  subset  of  18  of  the  toehold 
switches  exhibited  less  than  2%  subset- wide  crosstalk. 

A  relevant  metric  for  assessing  orthogonal  library  performance 
is  the  reciprocal  of  the  threshold  crosstalk  level.  For  example,  the 
set  of  1 8  switches  with  less  than  2%  crosstalk  has  a  reciprocal 
crosstalk  threshold  of  50.  This  metric  corresponds  to  the  mini¬ 
mum  output  dynamic  range  to  expect  in  a  network  employing 
this  library  of  18  switches.  Figure  2C  plots  this  library  dynamic 
range  metric  against  the  maximum  orthogonal  subset  size  for 
the  toehold  switches,  as  welt  as  a  number  of  previous  riboregu- 
lator  systems. 

Forward  Engineering  of  New  High-Performance  Devices 

In-silico-designed  riboregulators  forward  engineered  to  exhibit 
high  performance  in  vivo  have  the  potential  to  significantly 
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Figures.  Forward  Engineering  and  Thermo¬ 
dynamic  Analysis  of  Toehold  Switches 

(A)  Schematic  of  the  design  modifications  made 
for  the  forward-engineered  switches  compared  to 
the  first-generation  switches. 

(B)  ON/OFF  GFP  fluorescence  ratios  obtained  for 
the  set  of  1 3  forward -engineered  toehold  switches 
after  3  hr  of  induction.  Dashed  black  fine  marks  the 
mean  ON/OFF  fluorescence  measured  for  the  168 
first -generation  toehold  switches.  Inset:  time 
course  measurements  for  forward -engineered 
switches  number  6  and  number  9.  Relative  errors 
for  the  switch  ON/OFF  ratios  were  obtained  by 
adding  the  relative  errors  of  the  switch  ON  and 
OFF  state  fluorescence  measurements  in  quad¬ 
rature.  Relative  errors  for  ON  and  OFF  states  are 
from  the  SD  of  at  least  three  biological  replicates. 

(C)  Percentage  of  first-generation  and  forward- en¬ 
gineered  library  components  that  had  ON/OFF  ra¬ 
tios  that  exceeded  the  value  defined  on  the  x  axis. 

(D)  Correlation  between  AGrbs-^kw  and  QN/GFF 
ratio  measured  for  the  68  first-generation  toehold 
switches  that  had  an  A-U  base  pair  at  the  top  of  the 
hairpin  stem,  inset;  Schematic  showing  the  RNA 
sequence  range  in  the  trigger-switch  complex 
used  to  define  AGnes*,,^,  Relative  errors  for  the 
switch  ON/OFF  ratios  were  obtained  by  adding  the 
relative  errors  of  the  switch  ON  and  OFF  state 
fluorescence  measurements  in  quadrature.  Rela¬ 
tive  errors  for  ON  and  OFF  states  are  from  the  SD 
of  at  least  three  biological  replicates. 

[£)  Strong  correlation  (Ft2  =  0,79)  between  AGeas  i.nKor  and  ON/OFF  ratio  measured  for  the  complete  set  of  forward -engineered  switches.  Relative  errors  for  the 
switch  ON/OFF  ratios  were  obtained  by  adding  the  relative  errors  of  the  switch  ON  and  OFF  fluorescence  measurements  in  quadrature.  Relative  errors  for  ON  and 
OFF  states  are  from  the  SD  of  at  least  three  biological  replicates. 

See  also  Figure  S2  and  Table  S3. 
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reduce  the  time  required  for  generating  new  genetic  circuits.  The 
first-generation  toehold  switches  fell  short  of  this  goal  because 
they  displayed  large  variations  in  output  characteristics  (Fig¬ 
ure  IE).  To  determine  the  design  features  that  yielded  high- 
performance  switches,  we  performed  detailed  analyses  of  the 
first-generation  library  to  uncover  sequence-dependent  factors 
that  affected  switch  performance  (see  sections  S9  and  Si  2  of 
the  Extended  Experimental  Procedures;  Figures  S1G  and  S2A; 
Table  SI).  We  integrated  these  findings  into  the  design  of  sec¬ 
ond-generation  toehold  switches  forward  engineered  for  wide 
dynamic  range. 

The  forward -engineered  switches  incorporate  four  design 
changes  as  shown  in  Figure  3A  (see  Figure  SI  A  for  direct  com¬ 
parisons).  Modifications  were  made  to  the  sequences  at  the 
top  of  the  switch  RNA  stem,  the  loop  of  the  switch  RNA  was 
increased,  and  the  binding  site  of  the  trigger  RNA  was  shifted 
away  from  the  RBS.  These  three  design  changes  increased  the 
strength  of  the  switch  RNA  RBS  to  increase  its  ON  state  protein 
production.  The  length  of  the  toehold  domain  was  also  increased 
to  promote  binding  between  switch  and  trigger  RNAs. 

We  designed  and  evaluated  a  set  of  13  of  these  second-gen¬ 
eration  toehold  switches  (see  Table  S3  for  RNA  sequences). 
Figure  3 B  presents  the  ON/OFF  fluorescence  ratios  for  the  for¬ 
ward-engineered  systems  regulating  GFP  after  3  hr  of  induction 
(see  Figure  S2B  for  flow  cytometry  data).  There  is  a  dramatic  in¬ 
crease  in  ON/OFF  fluorescence  for  almost  all  the  systems 


tested,  with  12  out  of  13  switches  exhibiting  a  dynamic  range 
comparable  to  or  higher  than  the  highest  performance  toehold 
switch  from  the  initial  library  (ON/OFF  ratio  290  ±  20),  These 
systems  exhibit  an  average  ON/OFF  ratio  of  400  compared  to 
43  for  the  first-generation  design.  This  mean  ON/OFF  ratio  rivals 
the  dynamic  range  of  protein -based  regulators,  and  it  does  so 
using  a  highly  programmable  system  design  without  requiring 
any  evolution  or  large-scale  screening  experiments.  Further¬ 
more,  even  the  lowest  performance  second-generation  switch 
displayed  an  ON/OFF  ratio  of  33  ±  4,  which  is  still  sufficient 
for  many  logic  operations.  Hourly  time  course  measurements 
reveal  activation  of  forward -engineered  switches  after  1  or 
2  hr  of  induction  (Figure  3B,  inset).  Furthermore,  ON  state  fluo¬ 
rescence  increased  steadily  over  4  hrf  yielding  ON/OFF  levels 
well  over  600  for  the  switches.  Additional  testing  confirmed  suc¬ 
cessful  forward-engineered  switch  operation  in  different  strains 
of  E  coli  and  with  the  endogenous  E.  coli  RNA  polymerase  (see 
section  S10  of  the  Extended  Experimental  Procedures,  Fig¬ 
ure  S2C,  and  Table  S3). 

We  quantified  the  effectiveness  of  our  rational  engineering 
strategy  by  calculating  the  percentage  of  forward -engineered 
designs  with  ON/OFF  ratios  exceeding  a  given  minimal  level 
and  comparing  them  to  the  same  evaluation  performed  on  the 
library  of  168  first-generation  toehold  switches  (Figure  3C). 
The  yield  of  high-performance  switches  is  higher  for  the  for¬ 
ward-engineered  devices  for  all  ON/OFF  ratios  tested. 
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Thermodynamic  Parameter  Correlated  with  Switch 
Performance 

We  conducted  extensive  analyses  to  identify  thermodynamic  pa¬ 
rameters  that  were  linked  to  the  ON/OFF  performance  of  the 
toehold  switches.  Candidate  thermodynamic  factors  were  rapidly 
screened  by  computing  linear  regressions  against  subsets  of  the 
first-generation  toehold  switches  satisfying  particular  sequence 
constraints  (see  section  Si  3  of  the  Extended  Experimental  Proce¬ 
dures,  Figure  S2D,  and  Table  S3),  The  screening  process  identi¬ 
fied  a  single  parameter  AGrbs  tinker  that  was  correlated  with  the 
ON/OFF  ratios  for  the  subset  of  68  first-generation  switches 
with  an  A-U  base  pair  at  the  top  of  their  stem,  A  linear  fit  of 
AGRBS4inkar  to  the  logarithm  of  their  ON/OFF  ratios  yielded  a  co¬ 
efficient  of  determination  R2  =  0.40  (Figure  3D),  This  AGRBS.| inker 
term  is  defined  as  the  free  energy  of  the  sequence  running  from 
the  RBS  region  to  the  end  of  the  linker  (Figure  3D.  inset).  It  reflects 
the  amount  of  energy  required  by  the  ribosome  to  unwind  the 
RBS  and  early-mRNA  region  as  it  binds  and  begins  translation 
of  the  output  gene.  A  fit  of  AGRBs-iinkar  to  the  ON/OFF  ratio  for 
the  forward-engineered  switches  yielded  a  much  stronger  corre¬ 
lation  with  R2  -  0.79  (Figure  3E).  Importantly,  we  found  that  this 
single  thermodynamic  factor  could  explain  the  lone  low-perfor¬ 
mance,  forward- engineered  toehold  switch. 

Toehold  Switches  Triggered  by  mRNAs  and 
Endogenous  RNAs 

The  capacity  of  toehold  switches  to  accept  trigger  RNAs  with 
arbitrary  sequences  enables  them,  in  principle,  to  be  activated 
by  functional  mRNAs  (Figure  4A)>  However,  the  fixed  sequences 
of  potential  mRNA  triggers  present  significant  challenges  for 
effective  system  activation.  Unlike  synthetic  trigger  RNAs  de¬ 
signed  de  novo  to  be  single  stranded,  strong  secondary  struc¬ 
tures  abound  within  mRNAs,  complicating  toehold  binding  and 
decreasing  the  thermodynamics  driving  the  branch  migration 
process.  The  cognate  toehold  sequences  defined  by  the  trigger 
mRNA  can  also  exhibit  base  pairing  both  internally  and  with  se¬ 
quences  downstream  of  the  hairpin  module  and  thus  pose 
similar  challenges  to  activation. 

To  counter  these  effects,  we  increased  the  toehotd  domain 
length  of  the  mRNA-sensing  switches  from  12  or  15  nts  to  >24 
nts.  This  modification  shifted  the  importance  of  single -stranded 
regions  for  binding  initiation  from  the  trigger  mRNA  to  the  toehold 
switch  itself.  We  also  incorporated  a  common  sequence  element 
from  a  high-performance  first-generation  switch  into  the  sensor 
designs  (see  section  S14.1  of  the  Extended  Experimental  Proce¬ 
dures,  Figures  S3A-S3C,  and  Table  S4)  and  adopted  a  pro¬ 
grammed  RNA  refolding  mechanism  to  decrease  the  energetic 
barrier  to  switch  activation  (Figures  4 A  and  S3D;  see  section 
814.2  of  the  Extended  Experimental  Procedures).  Candidate 
sensors  for  a  given  mRNA  or  endogenous  RNA  were  designed 
and  screened  in  silico  to  identify  those  with  the  greatest  probabil¬ 
ity  of  success  (see  section  SI  4  3  of  the  Extended  Experimental 
Procedures,  and  Tabie  $4  for  RNA  sequences}. 

We  first  validated  sensors  for  detecting  the  mCherry  mRNA. 
These  sensors  express  GFR  only  upon  binding  to  mCherry  tran¬ 
scripts  enabling  simultaneous  monitoring  of  sensor  output  via 
GFP  and  transcription  of  the  trigger  species  through  mCherry 
fluorescence.  Figure  4 B  displays  the  GFP  and  mCherry  fluores¬ 


cence  levels  measured  for  three  m Cherry- responsive  switches  in 
the  absence  or  presence  of  their  cognate  trigger  mRNA,  Fluores¬ 
cence  levels  from  control  measurements  are  also  shown  for 
comparison.  Negative  controls  reflect  E.  coli  autofluorescence 
measured  in  either  fluorescence  channel ,  whereas  positive  con¬ 
trols  were  obtained  from  cells  expressing  unrepressed  GFP  and 
mCherry  from  medium  and  high  copy  vectors,  respectively. 
These  results  show  that  the  switches  strongly  activate  expres¬ 
sion  of  GFP  only  upon  transcription  of  the  mCherry  mRNA  and 
provide  low  GFP  expression  in  the  absence  of  the  mCherry 
trigger.  ON/OFF  GFP  fluorescence  values  obtained  from  the 
three  mCherry  mRNA  sensors  are  shown  in  Figure  4G.  In  addi¬ 
tion,  we  validated  two  sensors  for  detecting  antibiotic  resistance 
conferring  mRNAs:  chloramphenicol  acetyltransferase  (cat)  and 
aadA  (spectinomycin  resistance)  (Figure  4C). 

We  also  designed  a  toehold  switch  sensor  to  detect  the 
endogenous  E  COli  small  RNA  (sRNA)  RyhB.  RyhB  is  a  90  nt  tran¬ 
script  that  downregulates  iron-associated  genes  in  conditions  in 
which  iron  levels  are  low  (Masse  and  Gottesman,  2002).  To  char¬ 
acterize  the  sensors,  cells  were  transformed  with  plasmids 
constitutively  expressing  the  RyhB -responsive  toehold  switch 
regulating  GFP  (Figure  4D;  see  section  S15  of  the  Extended 
Experimental  Procedures  and  Table  S 4).  The  RyhB  sRNA  was 
induced  by  adding  the  iron -chelating  compound  2,2'-bipyridyl 
to  the  growth  media  (Figure  4D).  which  is  known  to  rapidly  stim¬ 
ulate  expression  of  RyhB  (Masse  and  Gottesman,  2002).  We 
measured  sensor  output  1  hr  after  induction  by  the  chelator  us¬ 
ing  flow  cytometry.  The  sensor  transfer  function  shows  a  steady 
increase  in  GFP  expression  as  2,2'-bipyridyl  levels  increase  to 
0,3  mM,  beyond  which  levels  plateau  (Figure  4E),  In  contrast,  a 
GFP -positive  control  construct  demonstrated  decreasing  output 
as  2,2'-bipyridyl  levels  increased. 

Synthetic  Regulation  of  Endogenous  Genes 
Toehold  switches  can  be  integrated  into  the  genome  to  regulate 
translation  of  endogenous  genes.  Template  genome-editing 
plasmids  were  constructed  that  contained  a  high-performance 
second-generation  switch  adjacent  to  a  kanamycin  resistance 
marker  flanked  by  a  pair  of  FRT  sites  (Figure  5 A;  see  section 
SI  6  of  the  Extended  Experimental  Procedures,  Table  S5,  and 
Figure  S4),  Linear  DNA  fragments  were  amplified  from  these 
plasmids  and  were  inserted  upstream  of  targeted  chromosomal 
genes  using  a  Red  recombination  (Datsenko  and  Wanner,  2000), 
The  resulting  E.  coli  strain  retains  a  functional  copy  of  the  tar¬ 
geted  gene  in  its  chromosome;  however,  It  is  deactivated  as  a 
result  of  the  cotranscribed  switch  RNA  module.  This  repressed 
gene  can  be  activated  posttranscriptionally  by  expression  of  a 
cognate  trigger. 

We  inserted  switches  upstream  of  the  genes  ttidAt  lacZt  and 
cheY,  The  genes  uidA  and  lacZ  produce  the  enzymes  (J-gtucor- 
onidase  and  [i-galactosidase,  respectively.  Cells  expressing 
these  enzymes  can  be  readily  identified  by  their  blue/green  color 
on  plates  containing  the  corresponding  substrates  X-Gluc  and 
X-Gal.  We  constructed  two  strains  with  synthetic  uidA  regulation 
by  integrating  switches  A  and  B  into  the  chromosome  {uidA::$- 
witch  A  and  uidA  “Switch  B,  respectively).  Figure  5B  displays 
these  strains  upon  expression  of  different  trigger  RNAs.  as 
well  as  a  control  strain  with  the  wild-type  uidA  genotype.  As 
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Figure  4.  Toehold  Switch  Activated  by  mRNA  and  Endogenous  Small  RNA  Triggers 

(A)  Design  schematic  and  putative  activation  pathway  of  the  toehold  switch  mRNA  sensors.  Switch  common  sequence  element  is  outlined  in  pink. 

(B)  Mode  GFP  and  mQherry  fluorescence  obtained  from  flow  cytometry  of  three  mCherry  sensors  in  their  repressed  and  activated  states,  as  well  as  positive  and 
negative  controls.  Error  Oars  are  the  SO  from  at  least  three  biological  replicates. 

(C)  ON/OFF  GFP  fluorescence  ratios  for  a  series  of  toehold  switches  activated  by  the  mChetry  mRNA,  and  caf  and  aarfA  mRNAs,  which  confer  chloramphenicol 
and  spectinomycin  resistance,  respectively.  Relative  errors  for  the  mRNA  sensor  ON/OFF  ratios  were  obtained  by  adding  the  relative  errors  of  the  sensor  ON  and 
OFF  state  fluorescence  measurements  In  quadrature.  Relative  errors  for  ON  and  OFF  states  are  from  the  SD  of  at  least  three  biological  replicates. 

(D)  Endogenous  and  synthetic  gene  networks  used  for  sensing  the  RyhB  sRNA. 

(E)  Transfer  function  for  the  RyhB  sensor  (purple  curve)  as  a  function  of  RyhB  inducer  concentration.  Output  of  a  constitutive  GFP  expression  cassette  is  shown  for 
comparison  (green  curve).  Error  bars  are  the  SD  from  at  least  three  biological  replicates 

See  also  Figure  S3  and  Table  34, 


expected,  uidA:: Switch  A  only  exhibits  the  blue/green  wild-type 
phenotype  upon  expression  of  trigger  A.  Similarly,  uidA:  .Switch 
B  activates  3-glucoronidase  only  with  cognate  trigger  B. 

The  edited  strain  iacZ:: Switch  C  provides  more  complicated 
behavior  because  the  lac  operon  is  regulated  at  the  transcrip¬ 


tional  level  by  lactose  or  chemical  analogs  such  as  IPTG.  Thus, 
IacZ:: Switch  C  requires  both  lactose/IPTG  and  trigger  RNA  C 
to  turn  on  expression  of  3-galactosidase,  This  behavior  results 
in  a  genetic  AND  circuit  combining  transcriptional  and  posttran- 
scriptional  regulation.  We  tested  this  AND  circuit  by  expressing 
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Figure  5*  Synthetic  Regulation  of  Endogenous  Genes 

(A)  Integration  of  switch  modules  into  the  genome  upstream  of  the  targeted  gene 
(gene  B)  at  sites  HI  and  H2  using  X  Red  recombination.  Switch-edited  gene  B 
is  translationally  repressed  but  can  be  activated  via  the  cognate  trigger  RNA. 

(B)  Images  of  uidA.S witch  A  and  uidA:: Switch  B  spread  onto  X-Gluc  plates 
with  different  trigger  RNAs,  uidA  expression  like  the  wild-type  (top  left)  is  only 
observed  with  cognate  trigger  RNAs  as  seen  by  biue/green  color  change. 

(C)  Images  of  fec£::Switch  C  with  different  combinations  of  IPTG  and  aTc 
chemical  inducers.  /acZ::Swrtch  C  only  activates  with  aTc-induced  expression 
of  trigger  C  in  conditions  in  which  tad  transcription  is  induced  by  IPTG-  Wild- 
type  lacZ  (top  left)  is  activated  whenever  IPTG  is  present, 

(D)  Motility  assays  for  cheV::Switcb  D  on  soft  agar  pFates.  cheY: 'Switch  D  is 
only  able  to  move  away  from  the  point  of  inoculation  at  the  plate  center  when 
trigger  D  is  induced  with  IPTG. 

See  also  Figure  $4  and  Table  S5. 


different  trigger  RNAs  using  inducible  promoters  responsive  to 
anhydrotetracydine  (aTc).  Figure  5G  provides  images  of  lacZ:: 
Switch  C  transformed  with  different  trigger  plasmids  for  all  four 
combinations  of  the  two  chemical  inputs  (i.e.,  IPTG  and  aTc)  of 
the  AND  circuit.  In  the  absence  of  )PTGr  none  of  the  strains 
show  any  change  in  color  because  expression  of  the  lac  operon 
is  strongly  repressed.  For  a  plate  containing  IPTG  and  no  aTc, 
the  wild-type  lacZ  strain  (upper  left  quadrant)  becomes  blue/ 
green  in  color,  whereas  the  lacZ: .Switch  C  strains  with  different 
triggers  do  not  change  in  color  because  the  trigger  RNAs  are 
not  being  expressed.  When  the  AND  condition  is  satisfied  by 
adding  both  IPTG  and  aTc  to  the  plate,  /acZ::3witch  C  exhibits 
the  blue/green  color  change  with  trigger  RNA  G  as  expected, 
whereas  those  expressing  triggers  A  and  B  are  unchanged. 

Lastly,  we  conditionally  regulated  the  £  coli  chemotaxis  gene 
cheY ,  C/re/:  [Switch  D  was  transformed  with  piasmids  that  ex¬ 
pressed  triggers  inducibly  via  IPTG.  Only  cells  expressing  trigger 
RNA  D  with  IPTG  demonstrated  significant  motility,  whereas 
those  expressing  noncognate  trigger  RNA  A  or  lacking  the 
IPTG  inducer  were  unable  to  move  from  the  point  of  inoculation 
(Figure  5D)+ 

Multiplexed  Regulation 

To  demonstrate  the  full  multiplexing  capabilities  of  toehold 
switches,  we  expressed  12  toehold  switches  in  the  same  ceil 
and  independently  confirmed  their  activity  via  flow  cytometry. 
We  used  four  different  fluorescent  proteins  (GFP,  venus,  ceru¬ 
lean,  and  mCherry)  as  reporters  and  constructed  three  plasmids 
to  express  each  of  the  reporter  proteins  (Figure  6 A;  see  section 
SI  7  of  the  Extended  Experimental  Procedures;  Table  S6).  Each 
of  these  proteins  was  regulated  using  a  different  switch  from 
the  second-generation  library.  The  resulting  constructs  were  ex¬ 
pressed  from  a  single  T7  promoter  as  ^3.4  kb  polycistronic 
mRNAs  resulting  in  a  total  synthetic  network  size  of  over  10  kb. 

Figure  6B  presents  the  outcome  of  the  multiplexing  experi¬ 
ments  with  the  output  of  each  reporter  represented  in  terms 
of  the  percentage  of  cells  expressing  the  reporter,  A  ceil  was 
determined  to  be  expressing  a  given  reporter  if  its  fluorescence 
exceeded  a  threshold  level  held  constant  for  all  the  plots  in  Fig¬ 
ure  6B  (see  section  SI  7,3  of  the  Extended  Experimental  Proce¬ 
dures  and  Figure  S5).  The  first  two  rows  of  Figure  68  display  the 
output  from  all  1 2  of  the  switches  activated  separately  from  a  sin¬ 
gle  expressed  trigger  RNA.  In  all  12  cases,  significant  expression 
is  only  observed  from  the  intended  reporter  with  limited  crosstalk 
in  the  three  other  channels. 

We  also  tested  activation  of  all  two-  and  three-color  combina¬ 
tions  of  reporter  proteins  and  observed  all  the  expected  output 
color  combinations  (Figure  6B).  We  detected  some  unintended 
leakage  of  cerulean  and  GFP  for  trigger  combinations  activating 
GFP/ venus  and  venus/cerulean,  respectively.  Because  single 
trigger  measurements  of  each  of  these  triggers  displayed  low 
leakage,  we  attribute  much  of  the  observed  leakage  in  these 
two-trigger  experiments  to  imperfect  compensation  of  the  fiow 
cytometry  data  caused  by  the  spectral  overlap  between  GFPt 
venus,  and  cerulean  (see  Figure  S5A).  We  also  successfully  acti¬ 
vated  ail  four  output  proteins  by  simultaneously  expressing  four 
trigger  RNAs  and  observed  low  system  leakage  from  two  non¬ 
cognate  trigger  RNAs  (Figure  6B,  two  bottom- right  panels). 
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Figure  6.  Simultaneous  Regulation  of  Gene  Expression  by  12  Toehold  Switches 

(A)  Schematics  of  plasmids  and  ^3,4’kb  polycistronie  mRNAs  used  for  multiplexing  studies.  Each  reporter  has  its  own  switch  RNA  that  can  be  independently 
activated  by  its  cognate  trigger  RNA. 

(B)  Percentage  of  cells  expressing  each  of  the  four  reporters  fora  set  of  24  different  trigger  RNA  combinations.  Gray  and  colored  circles  are  used  to  identify  the 
parhcuJar  trigger  RNA  being  expressed  and  the  corresponding  switch  RNA.  Error  bars  are  the  SD  from  at  least  three  biotogical  replicates. 

See  also  Figure  S5  and  Table  S6. 


1 0  Cell  759, 1-1 5,  November  6,  2014  ©201 4  Elsevier  Inc, 


Please  cite  this  article  in  press  as:  Green  et  aL,  Toehold  Switches:  De- Novo -Designed  Regulators  of  Gene  Expression,  Cell  (2014),  http:// 
dx.doi.org/10. 1 01 6/j.  cell. 201 4 . 1 0.002 


CeO 


A 


■> 


> 


I  gfp  I 


Figure  7.  A  Layered  4-Input  AND  Circuit 

(A)  Design  schematic  for  the  4- input  AND  circuit  consisting  of  three  2-input  AND  gates  formed  by  three  toehold  switches,  two  orthogonal  transcription  factors 
(ECF4l_491  and  ECF42_4454),  and  a  GFP  reporter 

(B)  Complete  16-element  truth  table  for  the  4-mput  AND  system.  GFP  expression  from  the  sole  logical  TRUE  output  case  with  all  input  RNAs  expressed  (far  right) 
is  significantly  higher  than  the  logical  FALSE  output  cases  where  one  or  more  input  RNAs  is  absent.  Relative  errors  for  the  circuit  ON/OFF  ratios  were  obtained  by 
adding  the  relative  errors  of  the  circuit  fluorescence  measurements  in  quadrature.  The  fluorescence  level  measured  from  cells  not  expressing  any  of  the  input 
RNAs  was  used  as  the  OFF  state  fluorescence  levels  in  the  ON/OFF  calculations.  Relative  errors  for  different  input  states  are  from  the  SD  of  at  least  three 
biological  replicates. 

See  also  Figure  S6  and  Table  S7+ 


Implementation  of  a  4-Input  AND  Circuit 

Toehold  switches  are  readily  integrated  with  existing  biological 
components  to  build  sophisticated  genetic  programs,  We  con¬ 
structed  a  layered  4-input  AND  gate  consisting  of  three  toehold 
switches  coupled  to  two  orthogonal  transcription  factors  and  a 
GFP  reporter  {Figure  7A).  In  this  circuit,  toehold  switch  RNA 
pairs  act  as  two  independent  input  species  that  must  both  be 
present  before  a  2-input  logical  AND  expression  evaluates  as 
TRUE.  The  first  computational  layer  in  the  circuit  consists  of 
two  2-input  AND  toehold  switch  gates,  which  each  produce  a 
transcription  factor,  A  pair  of  extracytoplasmic  function  (EGF) 
sigma  factors  were  used  as  the  transcription  factors  in  the  cir¬ 
cuit  because  they  had  previously  been  reported  to  be  highly 
orthogonal  (Rhodius  et  aL  2013).  The  sigma  factors  produced 
from  the  first  layer  then  activate  transcription  of  the  toehold 
switch  RNAs  in  the  second  computational  layer,  which  in  turn 
lead  to  expression  of  a  GFP  reporter.  Similar  layered  circuits 
have  previously  been  constructed  using  transcription  factors 


that  required  a  second  chaperone  protein  for  full  activity 
(Moon  et  aL  2012). 

To  validate  the  circuit,  we  constructed  plasmids  to  express  all  1 6 
combinations  of  the  four  input  trigger  and  switch  RNAs  (see  sec¬ 
tion  SIS  of  the  Extended  Experimental  Procedures,  Table  S7, 
and  Figure  86),  The  full  truth  table  for  the  4 -in put  AND  computation 
is  shown  in  Figure  7B  with  ON/OFF  levels  calculated  by  normal¬ 
izing  GFP  fluorescence  to  that  measured  for  the  case  in  which 
none  of  the  input  bits  are  expressed.  GFP  output  when  ail  four 
input  bits  are  present  is  significantly  higher  than  all  other  input  per¬ 
mutations,  as  expected  for  a  functional  4-input  AND  computation. 

DISCUSSION 

Origins  of  Increased  Toehold  Switch  Performance 
Compared  to  Previous  Riboregulators 

Toehold  switches  represent  a  versatile  and  powerful  new  plat¬ 
form  for  regufating  translation  at  the  posttranscnptional  level. 
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They  combine  a  high  degree  of  component  orthogonality  with 
system  dynamic  range  comparable  to  protein-based  transcrip¬ 
tional  regulators  (Lutz  and  Bujard,  1997;  Rhodius  et  al,  2013). 
Of  note,  the  dynamic  range  of  the  toehold  switches  is  signifi¬ 
cantly  greater  than  that  observed  for  previous  engineered  ribcr- 
egulators.  Earlier  efforts  to  design  riboregulators  with  higher  per¬ 
formance  have  been  hindered  by  three  general  misconceptions. 
(1)  Because  natural  riboregulators  have  served  as  the  starting 
point  for  new  riboregulators,  it  has  been  implicitly  assumed 
that  engineered  ribo regulators  cannot  be  substantially  modified 
from  their  parent  natural  systems  without  abolishing  their  func¬ 
tionality.  This  approach  employing  small  sequence  perturbations 
parallels  efforts  in  engineering  new  protein -based  parts.  How¬ 
ever,  It  neglects  the  considerable  advantage  in  in  silico  simu¬ 
lation  and  design  available  for  engineering  RNAs  compared  to 
proteins.  This  underlying  assumption  has  in  turn  led  to  (2)  an 
emphasis  on  loop- loop  and  loop- 1  inear  RNA  interaction  mecha¬ 
nisms  and  (3)  the  use  of  RNA  binding  to  the  RBS  region  to 
repress  translation. 

Although  application  of  any  of  these  criteria  for  engineered  ri¬ 
boregulators  does  not  necessarily  preclude  them  from  func¬ 
tioning  well  as  biological  parts,  toehold  switches  demonstrate 
that  each  of  these  design  rules  is  not  required  for  high-perfor¬ 
mance  devices.  First,  toehold  switches  do  not  have  any  direct 
natural  ribo  regulator  counterparts,  and  thus  they  have  no  natural 
analogs  to  narrow  the  sequence  space  of  potential  devices. 
Relieving  this  constraint  enabled  us  to  generate  large  libraries 
of  potential  devices  and  explore  a  more  diverse  array  of  se¬ 
quences  compared  to  previous  riboregulators.  Furthermore, 
this  new  approach  allowed  RNA  sequences  to  be  designed  de 
novo  with  ideal  secondary  structures  for  optimal  device  perfor¬ 
mance.  Thus,  trigger  RNA  transcripts  feature  single-stranded 
domains  for  binding  to  the  switch  RNA,  and  the  ensuing 
trigger-switch  RNA  complex  is  engineered  to  have  low  second¬ 
ary  structure  nearby  the  start  codon  to  encourage  efficient  trans¬ 
lation  by  the  ribosome  to  maximize  the  device  ON  state. 

Second,  toehold  switch  RNA  interactions  are  mediated 
through  linear-linear  interactions  rather  than  loop-mediated 
ones.  These  reactions  can  provide  faster  kinetics  than  loop- 
based  reactions,  and  the  long  £12  nt  toehold  of  the  switch 
RNA  provides  stronger  thermodynamics  than  the  shorter  loop 
domains  of  earlier  riboregulators.  These  features  cause  a  higher 
percentage  of  the  total  switch  RNAs  present  in  the  cell  to  be  trig¬ 
gered,  thus  Increasing  production  of  the  output  protein.  We 
found  that  the  fraction  of  activated  switch  RNAs  was  ^100% 
based  on  comparison  measurements  with  un  re  pressed  versions 
of  the  switch  RNA  (Figures  2 A  and  2B),  In  contrast  to  earlier  re¬ 
ports,  thermodynamic  analyses  of  toehold  switch  performance 
did  not  reveal  significant  correlations  between  riboregulator 
ON/OFF  levels  and  the  free  energy  of  the  switch -trigger  interac¬ 
tion  nor  the  free  energy  of  toehold-trigger  binding  (Mutalik  et  al .. 
2012).  These  observations  suggest  that  RNA-RNA  interactions 
for  the  toehold  switches  are  so  strongly  thermodynamically 
favored  that  other  factors  are  presently  limiting  device  perfor¬ 
mance  in  ceils. 

Third,  toehold  switches  do  not  rely  on  direct  binding  to  the 
RBS  to  repress  translation.  Instead,  the  RBS  is  enclosed  within 
a  loop,  and  repression  is  achieved  by  secondary  structures 


immediately  before  and  after  the  start  codon.  This  design  feature 
increased  the  space  of  potential  trigger  RNA  sequences,  and  it 
provided  a  straightforward  means  of  enhancing  ON  state  trans¬ 
lation  levels  simply  by  adding  A-rich  sequences  to  the  loop  up¬ 
stream  of  the  RBS.  Importantly,  these  modifications  did  not 
require  corresponding  changes  to  the  trigger  RNA,  which 
enabled  facile  interpretation  of  the  effects  of  modifications  to 
this  p re- RBS  region.  Taken  together,  al!  these  design  features 
enabled  the  toehold  switch  to  detect  arbitrary  trigger  RNAs 
and  translate  any  output  protein. 

Our  results  suggest  that  existing  riboregulators  can  be 
improved  substantially  by  adopting  some  of  the  features  of  our 
toehold  switches.  The  length  of  loop  domains  in  loop-mediated 
interactions  can  be  increased  to  improve  reaction  thermody¬ 
namics.  Furthermore,  the  unpaired  region  upstream  of  the  RBS 
in  translational  riboregulators  can  be  lengthened  to  enhance  pro¬ 
tein  output  and,  in  turn,  device  dynamic  range. 

Constraints  on  Toehold  Switch  Trigger  RNAs  and 
Output  Proteins 

The  designs  we  have  chosen  for  the  toehold  switches  place 
some  restrictions  on  the  sequences  of  trigger  RNAs.  Inclusion 
of  the  3  nt  bulge  at  the  start  codon  AUG  position  of  the  switch 
RNA  precludes  the  trigger  RNA  from  having  an  AUG  sequence 
at  positions  programmed  to  hybridize  with  this  bulge.  The  switch 
RNA  also  possesses  a  9  bp  stem  after  the  start  codon,  which 
must  not  code  for  an  in  frame  stop  codon.  This  restriction  in 
turn  imposes  sequence  conditions  on  the  trigger,  which  must  un¬ 
wind  this  region  of  the  switch  RNA  stem.  The  toehold  switches 
also  add  1 1  residues  from  the  switch  stem  to  the  linker  to  the 
N  terminus  of  the  regulated  protein,  which  could  be  problematic 
for  an  output  protein  sensitive  to  changes  in  its  N  terminus. 

In  practice,  the  conditions  imposed  on  the  trigger  RNA  and 
output  protein  can  be  avoided  with  a  few  modifications  to  the 
toehold  switch  design.  Constraints  on  the  trigger  RNA  are  a  by¬ 
product  of  the  base- pairing  conditions  specified  for  the  switch 
RNA  stem  and  the  trigger-switch  complex.  These  particular  sec¬ 
ondary  structures  are  not  strictly  required  for  switch  operation. 
For  instance,  forward- engineered  switch  number  5  has  only  a 
1  nt  bulge  in  its  stem  but  provides  ON/OFF  >  400.  We  expect 
that  design  modifications  that  add  and  subtract  base  pairs 
from  the  switch  RNA  will  allow  the  toehold  switches  to  modulate 
gene  expression  while  providing  sufficient  design  flexibility  to 
eliminate  stop  and  start  codon  constraints  on  the  trigger  se¬ 
quence.  The  toehold  switches  can  also  be  modified  to  incorpo¬ 
rate  the  coding  sequence  of  the  output  protein  directly  into  the 
switch  RNA  stem,  thus  making  them  compatible  with  any  protein 
sensitive  to  N-terminal  modifications. 

Engineered  Regulators  with  Very  Low  System  Crosstalk 

The  capacity  of  toehold  switches  to  respond  to  diverse  trigger 
RNAs  enabled  us  to  use  in  silico  techniques  to  design  iarge  li¬ 
braries  of  orthogonal  components.  Comprehensive  evaluations 
of  in  vivo  trigger-switch  RNA  pairwise  interactions  revealed  a 
set  of  26  toehold  switches  with  12%  crosstalk  levels.  The  largest 
previously  reported  orthogonal  riboregulator  set  consisted  of 
seven  transcriptional  attenuators  displaying  20%  crosstalk 
(Takahashi  and  Lucks,  2013),  In  this  case,  crosstalk  level  of 
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20%  meant  that  the  set  of  42  off-target  RNA  sense -anti  sense  in- 
teractions  attenuated  transcription  by  at  most  20%.  Such  cross¬ 
talk  results  in  an  upper  bound  in  overall  library  dynamic  range  of 
5  {Figure  3C).  Earlier  orthogonal  translational  activators  and  re¬ 
pressors  have  been  limited  to  sets  of  four  (Callura  et  al.,  2012) 
and  six  (Mutalik  et  al,  2012),  respectively ,  at  20%  crosstalk. 
For  proteins,  an  engineered  library  of  five  orthogonal  eukaryotic 
transcription  factors  with  crosstalk  of  —30%  was  reported  (Khalil 
et  al.,  2012).  Recently,  part  mining  was  used  to  engineer  a  set  of 
four  orthogonal  chimeric  sigma  factors  with  —10%  crosstalk  and 
a  subset  of  three  with  —2%  crosstalk  {Rhodius  et  al.,  201 3).  Thus, 
to  our  knowledge,  these  toehold  switches  constitute  the  largest 
set  of  engineered  orthogonal  regulatory  elements,  RNA-  or  pro¬ 
tein-based,  reported  to  date. 

Crosstalk  experiments  also  revealed  smaller  sets  of  orthogonal 
switches  exhibiting  substantial  improvements  in  overall  dynamic 
range,  including  a  set  of  18  with  50-fold  predicted  minimum  dy¬ 
namic  range  {<2%  crosstalk)  and  a  set  of  5  with  >200-fold  dy¬ 
namic  range  (<0.5%  crosstalk).  Subsets  of  orthogonal  toehold 
switches  of  comparable  size  to  previously  reported  riboregulator 
libraries  therefore  exhibit  minimum  dynamic  range  over  an  order 
of  magnitude  larger  than  the  earlier  RNA-based  systems.  At  this 
point,  the  ultimate  size  of  the  orthogonal  libraries  of  toehold 
switches  is  limited  by  throughput  of  our  crosstalk  assay,  not 
design  features  intrinsic  to  the  devices  themselves. 

RNA-Based,  Forward-Engineered  Components 
for  Synthetic  Biology 

We  anticipate  that  our  large  library  of  orthogonal  toehold  switches 
will  enable  new  applications  and  capabilities  for  synthetic  biology. 
Their  combination  of  wide  dynamic  range,  low  leakage  levels, 
orthogonality,  and  rapid  response  are  ideal  for  implementing 
time-sensitive  logic  circuitry  in  living  systems.  Furthermore,  the 
protein-like  dynamic  range  of  the  toehold  switches  is  achieved 
at  a  lower  metabolic  cost  than  protein-based  systems  because 
all  regulatory  components  are  composed  of  RNA,  and  translation 
only  occurs  if  the  switches  are  activated.  As  synthetic  genetic  net¬ 
works  grow  in  complexity  and  impose  greater  burdens  on  the 
host,  the  lower  metabolic  footprint  of  the  toehold  switches  will 
become  an  increasingly  important  ad  vantage  in  their  favor.  More¬ 
over,  their  low  leakage  levels  suggest  they  can  be  used  as  const!  - 
tutively  expressed  passive  monitoring  or  sensor  systems  that 
trigger  a  downstream  response  only  in  desired  situations. 

Our  success  in  using  in  silico  design  tools  to  reliabiy  produce 
high-performance,  orthogonal  toehold  switches  represents  an 
important  step  forward  for  synthetic  biology.  The  process  of  con¬ 
structing  new  synthetic  gene  networks  generally  requires  multiple 
design-build-test  cycles  before  the  desired  network  behavior  is 
achieved.  Consequently,  new  devices  like  the  toehold  switch 
that  routinely  provide  functional  in-silico-designed  components 
will  reduce  the  number  of  design  cycles  required  for  optimization 
by  improving  the  components  tested  with  each  passing  cycle.  Im¬ 
provements  in  device  design  can  be  used  for  decreasing  devel¬ 
opment  time  but  will  also  facilitate  construction  of  more  complex 
genetic  networks.  In  particular,  in  silico  design  tools  currently 
enable  simulation  of  multiple  interacting  RNAs,  which  could  be 
used  for  carrying  out  complex  logic  operations  {Qian  and  Winfree, 
2011)  from  programmed  RNA  networks.  In  these  higher-order 


computations,  the  programmability  of  toehold  switches  provides 
them  with  a  distinct  advantage  over  protein- based  regulators  that 
are  currently  less  amenable  to  forward  engineering.  We  also 
expect  that  additional  improvements  to  toehold  switch  design 
performance  can  be  obtained  by  incorporating  the  effects  of 
the  AG  Res  linker  term  into  future  in  silico  design  algorithms. 

Toehold  Switches  as  RNA-to-Protein  Signal 
Transducers 

The  ability  of  toehold  switches  to  activate  translation  in  response 
to  arbitrary  RNA  sequences,  including  full-length  nnRNAs,  en¬ 
ables  them  to  act  as  universal  RNA-to-protein  signal  trans¬ 
ducers.  This  new  capability  provides  a  general  izable  approach 
to  interfacing  endogenous  RNA  networks  with  artificial  genetic 
networks  for  synthetic  biology  and  a  nondestructive  means  of 
monitoring  RNAs  in  living  ceils  for  answering  fundamental  ques¬ 
tions  in  biology.  Many  future  applications  in  synthetic  biology  will 
require  genetic  circuits  that  can  sense  the  current  status  of  the 
celi  and  operate  on  this  information  to  modulate  their  activity. 
For  instance,  this  capability  can  be  used  in  metabolic  engineer¬ 
ing  to  construct  self-optimizing  networks  that  detect  cellular 
stress  levels  or  pathway-associated  RNAs  to  tune  network  activ¬ 
ity  up  or  down  for  maximizing  yields  of  useful  chemical  products. 
For  biological  studies,  toehold  switch  signal  transducers  can  be 
used  to  simultaneously  monitor  the  levels  of  multiple  untagged, 
endogenous  RNA  species  by  producing  different  output  fluores¬ 
cent  proteins. 

Potential  Role  of  Similar  Systems  in  Nature 

The  protein-like  dynamic  range  of  toehold  switches  challenges 
the  widely  held  view  that  RNA-based  systems  offer  weaker  regu¬ 
lation  than  their  protein  counterparts  (Liu  et  al  . ,  201 2),  The  perfor¬ 
mance  of  these  rlbo regulators  also  suggests  that  some  of  their 
operating  mechanisms  may  already  play  a  rote  in  natural  sys¬ 
tems,  Although,  to  our  knowledge,  analogous  systems  have  yet 
to  be  found  in  nature,  our  results  indicate  that  a  re-examination 
of  bacterial  genomes  in  a  directed  search  for  these  specific  reg¬ 
ulatory  mechanisms  may  be  in  order.  Conversely,  the  absence 
of  well-known  natural  systems  that  exploit  the  design  principles 
of  toehold  switches  could  also  imply  that  these  synthetic  devices 
are  not  easy  to  evolve.  In  particular,  we  hypothesize  that  the 
loop  regions  of  loop-mediated  ri  bo  regulators  are  more  insulated 
from  interactions  with  nearby  bases  than  the  unprotected  sin¬ 
gle-stranded  regions  of  toehold  switch  trigger  RNAs.  Thus, 
single-stranded  trigger  RNAs  must  sample  a  greater  number  of 
potential  sequences  to  evolve  to  their  intended  function  than  nat¬ 
ural  I oop -mediated  trigger  RNAs. 

EXPERIMENTAL  PROCEDURES 

The  Extended  Experimental  Procedures  are  available  online. 

Strains  and  Growth  Conditions 

The  following  E.  coli  strains  were  used  in  this  study;  BL21  Star  DE3  (F~  ompT 
hsdS&  (re" ms")  gatdcm  me13l  [DE3];  Invitrogen):  8L21  DE3  [F  ompTh$dSB 
[rB ‘  me  )  gal  dcm  [DE3];  Invitrogen);  MG  1 655Pro  (F-  X“  M3  -  rfb-50  rpb- 1  Sp R 
facR  tetR):  and  DH5a  {endA  1  recAl  gyrA96  thi- 1  gtnV44  ret  A  t  h$dR  1 7(rK '  mK’"> 
X").  AEE  strains  were  grown  in  LB  medium  with  appropriate  antibiotics  at  37 
(see  also  section  Si  of  the  Extended  Experimental  Procedures). 
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Plasmid  Construction 

All  DMA  oligonucleotides  were  purchased  from  Integrated  DN A  Technologies, 
Plasmids  were  constructed  and  purified  using  standard  molecular  biology 
techniques.  Table  Si  contains  the  primers  and  sequences  used  for  experi¬ 
ments  (see  also  section  S2  of  the  Extended  Experimental  Procedures), 

Flow  Cytometry  Measurements  and  Analysis 

Flow  cytometry  was  performed  using  a  BD  LSRFortessa  cell  analyzer  equip¬ 
ped  with  a  high-throughput  sampler.  Cells  were  typically  diluted  by  a  factor 
of -65  into  phosphate  buffered  saline  (PBS)  and  sampled  from  96- well  plates. 
Forward  scatter  (FSC)  was  used  for  trigger,  and  -20,000  individual  cells  were 
analyzed.  Error  levels  for  the  fluorescence  measurements  of  ON  state  and  OFF 
state  cells  were  calculated  from  the  SD  of  measurements  from  at  least  three 
biological  replicates.  The  relative  error  levels  for  the  ON/OFF  fluorescence  ra¬ 
tios  were  then  determined  by  adding  the  relative  errors  of  ON  and  OFF  state 
fluorescence  in  quadrature  (see  also  sections  S3  and  $4  of  the  Extended 
Experimental  Procedures). 

SUPPLEMENTAL  INFORMATION 

Supplemental  Information  includes  Extended  Experimental  Procedures f  six 
figures,  and  seven  tables  and  can  be  found  with  this  article  online  at  http:// 
dx  .doi  org/1 0. 1 01 6/j„ce!F.  201 4. 1 0.002 . 
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Complex  Reconfiguration  of  DNA  Nanostructures** 
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Abstract:  Nucleic  acids  have  been  used  to  create  diverse 
synthetic  structural  and  dynamic  systems i  Toehold-mediated 
strand  displacement  has  enabled  the  construction  of  sophisti¬ 
cated  circuits,  motors,  and  molecular  computers „  Yet  it  remains 
challenging  to  demonstrate  complex  structural  reconfiguration 
in  which  a  structure  changes  from  a  starting  shape  to  another 
arbitrarily  prescribed  shape ,  To  address  this  challenge,  we  have 
developed  a  general  structural-reconfiguration  method  that 
utilizes  the  modularly  interconnected  architecture  of  single- 
stranded  DNA  tile  and  brick  structures.  The  removal  of  one 
component  strand  reveals  a  newly  exposed  toehold  on 
a  neighboring  strand,  thus  enabling  us  to  remove  regions  of 
connected  component  strands  without  the  need  to  modify  the 
strands  with  predesigned  external  toeholds.  By  using  this 
method,  we  reconfigured  a  two-dimensional  rectangular  DNA 
canvas  into  diverse  prescribed  shapes.  We  also  used  this 
method  to  reconfigure  a  three-dimensional  DNA  cuboid. 

The  self-assembly  of  nucleic  acids  (DNA  and  RNA)  has 
produced  diverse  synthetic  structures.^ j29J  In  particular,  DNA 
origam#  12-35  ^ztu3]  and  single-stranded  tiles  (SSTs)  and 
brick#3  22,24,25 5  have  enabled  the  construction  of  megadalton 
discrete  structures  with  arbitrarily  prescribed  shapes,  tn 
parallel,  researchers  have  used  strand  displacement  to 
demonstrate  the  construction  of  dynamic  systems^  such  as 
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switches#1!  walkers# 32,331  circuits#4,32,35!  arid  triggered  assem¬ 
bly  systems,1 136,321  which  can  go  through  multiple  states  of 
different  configurations,  either  in  a  directed  or  in  an 
autonomous  fashion. 
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Figures  Schematic  representation  of  structural  reconfiguration  from 
an  SSI  canvas,  a)  Strand  diagram  of  strand-dispfacement-based  SST 
structural  reconfiguration  and  b)  the  associated  interaction  graph.  The 
strand/node  to  be  displaced  is  highlighted  in  red  with  four  domains  1, 
2,  3,  and  4  complementary  to  domains  1*.  2*,  3*.  and  4*,  respectively, 
of  neighboring  strands.  When  Introduced  into  the  system,  a  full 
complementary  strand  4*-3*-2*-l*  (in  salmon)  forms  a  duplex  with 
the  red  target  component  Strand  to  displace  the  target  from  the 
canvas,  c)  Strand  diagram  of  strand  displacement  for  an  12  helix 
(H)xlOturns  (T)  canvas  and  d)  the  associated  interaction  graph. 
Strands/nodes  highlighted  in  red  on  the  left  depict  the  subset  of 
component  strands  to  be  displaced.  The  carved  structure  is  shown  on 
the  right. 
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Researchers  have  combined  techniques  for  DNA  strand 
displacement  with  methods  for  DNA  structural  assembly  to 
create  recon figurable  and/or  reversible  structures.  For  exam¬ 
ple,  DNA  origami  boxes113*  and  clamshells*20*  were  reconfig¬ 
ured  from  a  dosed  to  an  open  state  through  the  toehold- 
mediated  strand  displacement  of  a  few  component  strands  in 
the  structure.  More  complex  reconfiguration  methods  have 
placed  single-stranded  toeholds  at  selected  sites  on  the 
structure  for  shape  transformation,*37-"10*  including  the  forma¬ 
tion  of  a  catenane  derived  from  a  DNA- origami  Mobius 
strips  and  the  changing  of  fractal  patterns  in  origami 
structures,**0* 

However,  it  still  remains  challenging  to  develop  a  general 
framework  for  complex  structural  reconfiguration  in  which 
a  structure  changes  from  a  particular  starting  shape  to  another 
arbitrarily  prescribed  shape.  To  address  this  challenge,  we 
have  developed  a  method  based  on  the  modularly  intercon¬ 
nected  architecture  of  single -stranded  DNA  tile  and  brick 
structures.  The  removal  of  one  component  strand  reveals 
a  newly  exposed  toehold  on  a  neighboring  strand,  thus 
enabling  us  to  remove  connected  regions  of  component 
strands  without  the  need  to  modify  them  with  predesigned 
external  toeholds.  This  method  applies  only  to  SST/brick- 
based  structures  and  not  to  origami-based  structures  because 
successful  reconfiguration  relies  on  the  modular  architecture. 

By  using  this  method,  we  demonstrate  that  a  two-dimen¬ 
sional  rectangular  DNA  SST  canvas*21,25*  can  be  reconfigured 
into  many  different  shapes,  including  two  full  sets  of  the 
alphabet  (one  carved  in  intaglio,  with  the  cavity  forming  the 
letters,  and  the  other  in  relief,  with  DNA  forming  the  letters). 
We  also  show  that  individual  carved  pieces  can  be  reassem¬ 
bled  to  form  the  original  canvas  and  be  subjected  to  a  second 
round  of  reconfigurations.  Finally,  this  molecular-carving 
concept  was  also  applied  to  a  three-dimensional  DNA-brick 
cuboid  structure.*22*  Overall,  we  demonstrate  that  SST  and 
brick  reconfiguration  is  a  robust  and  modular  method  for 
engineering  complex  structural  reconfiguration,  with  resolu¬ 
tion  at  the  scale  of  the  component  strand  (e.g.  3x7  nnr  for 
atypical  42  nt  SST[21]), 

In  an  SST  structure,*21,25*  each  strand  typically  has  four 
binding  domains  that  are  complementary  to  four  different 
neighboring  strands,  as  depicted  in  Figure  1  a,c.  A  self- 


assembled  rectangular  SST  structure  can  be  viewed  as 
a  “molecular  canvas"  and  depicted  as  an  interaction  graph 
(Figure  1  b,d):  each  node  represents  a  component  strand,  and 
each  edge  represents  the  binding  interaction  between  two 
strands  or  nodes,  A  component  strand  (red)  can  be  removed 
(“carving")  by  introducing  a  “carving  strand"  (salmon)  that  is 
fully  complementary  to  the  component  strand  (Figure  1  a,b). 
Unlike  previous  strand-displacement  approaches,  our  method 
does  not  involve  the  use  of  an  external  toehold*"4*  (the 
detailed  molecular  mechanism  is  discussed  later).  By  the 
displacement  of  multiple  component  strands  with  correspond¬ 
ing  carving  strands,  the  canvas  can  be  reconfigured  into 
a  prescribed  shape  (Figure  1  c,d).  Moreover,  because  each 
component  strand  can  be  removed  modularly,  it  is  possible  to 
create  a  combinatorially  large  number  of  distinct  shapes. 

The  canvas*25*  was  made  by  the  self-assembly  of  375 
distinct  component  strands  in  15  mM  Mg2t  buffer  at  48  °C 
overnight  with  30%  yield,  as  determined  by  2%  native  gel 
electrophoresis  (see  Figure  S4  in  the  Supporting  Informa¬ 
tion).  The  structures  showed  the  expected  morphology  and 
dimensions  under  atomic  force  microscopy  (AFM)  imaging 
(Figure  2  a,  bottom  left). 

To  reduce  the  overall  time  and  potential  human  errors  in 
the  picking  and  mixing  of  carving  strands,  we  wrote  a  com¬ 
puter  program  to  design  target  shapes.  The  program  takes  the 
desired  shape  as  input  and  then  directs  a  robotic  liquid 
handler  to  select  the  appropriate  subset  of  strands  from 
a  master  library  of  375  carving  strands.  These  carving  strands 
were  then  applied  to  the  canvas  solution  in  an  equimolar  ratio 
for  overnight  incubation  at  35  or  45  DC  to  produce  the  carved 
shapes.  Figure  2  a  shows  an  example  of  carving  a  comer  off 
the  canvas  (the  detailed  carving  mechanism  is  discussed 
later).  Figure  2  b  shows  AFM  images  of  the  carved  shapes  of 
two  full  sets  of  the  alphabet  in  intaglio  and  relief.  The  yield  of 
the  reconfigured  structure  in  Figure  2a  was  determined  by  gel 
electrophoresis  to  be  50%  (see  Figure  S17)  and  by  AFM  to  be 
99%  (N  =  202;  see  Figure  S30).  Such  a  high  AFM  yield  is  not 
typical  for  the  more  complex  alphabet  structures  that  we 
carved  (see  Figures  S4  and  S5  (gels)  and  Figures  S6-S 15 
(AFM  images). 

Unlike  most  previous  si  rand-displacement- based  dynamic 
systems,  our  carving  scheme  does  not  use  predesigned 


Figure  2.  Alphabet  sets  reconfigured  from  a  rectangular  SST  canvas,  a)  Interaction  graphs  (top)  and  AFM  images  (bottom)  of  the  24  Hx29T 
canvas  used  in  this  study  (left)  and  its  reconfiguration  into  a  rectangle  with  a  missing  corner  (right).  Scale  bars:  100  nm.  b)  AFM  images  of 
letters  of  the  alphabet  carved  in  intaglio  (top)  and  relief  (bottom).  Each  Image  is  150x  150  nm2  in  size.  See  Figures  S4  and  S5  in  the  Supporting 
Information  for  the  results  of  agarose  gel  electrophoresis. 
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Figure  3*  Diagrams  and  AFM  images  of  a  mechanistic  study,  In  each  case,  interaction  graphs  with  the  carving  pattern  highlighted  in  red  and  blue 
are  shown  at  the  top,  and  AFM  images  are  shown  at  the  bottom  (scale  bars:  TOO  nm).  a-d)  Carving  of  pattern  1  without  predesigned  external 
toeholds  (a,b)  and  pattern  V  with  predesigned  external  toeholds  (c.  d)  e-h)  Carving  of  patterns  2-5,  M)  Experiments  demonstrating  the 
reversibility  of  carving:  reconfiguration  of  the  canvas  to  give  pattern  4,  reassembly,  and  generation  of  pattern  R5  (identical  to  pattern  5J,  The 
strand  diagram  and  interaction  graph  boxed  by  the  dashed  lines  show  the  reconfiguration  mechanism  at  the  zoomechin  Fraction  of  the  canvas  (in 
a,  cf  eh  ip  and  k)  Gray  depicts  common  components;  red  depicts  the  strands  without  external  toeholds  to  be  carved;  blue  marks  the  presence  of 
an  exposed  slngle^stranded  toehold;  black  indicates  the  introduction  of  strands  for  reassembly  of  the  carved  canvas  pieces.  See  the  Supporting 
Information  for  a  detailed  Study  of  carving  yields. 


external  toeholds  to  initiate  strand  displacement.  We  thus 
conducted  a  set  of  experiments  to  study  the  effects  of  external 
toeholds  on  carving  (Figure  3  a-d,  patterns  1  and  Y  high¬ 
lighted  in  red).  We  designed  a  different  canvas  in  which  the 
component  strands  in  the  carving  pattern  had  external 
toeholds  (blue;  Figure  3c,d,  pattern  1).  After  overnight 
carving  at  45 °C,  gel  electrophoresis  showed  minimal  carving 
for  the  toehold-free  canvas  (Figure  3  a,b,  pattern  1 ;  yield 
around  0%;  see  Figure  S 16)  but  significant  carving  for 
the  canvas  with  external  toeholds  (Figure  3  c,d;  19-39% 


yield;  see  Figure  SI 6).  AFM  images  were  consistent  with 
the  gel  results:  carving  a  canvas  without  predesigned  exter¬ 
nal  toeholds  rarely  reached  completion  (ca.  14%  yield: 
see  Figure  S27),  whereas  carving  a  canvas  with  external 
toeholds  led  to  higher  conversion  (ca.  81  %  yield :  see 
Figure  S28).  The  observation  of  higher  AFM  yields  than  gel 
yields  suggests  that  partially  carved  structures  may  have 
comigrated  with  the  intact  canvas  on  the  gel,  but  may  have 
broken  apart  under  AFM  (e.g.  during  the  deposition  or 
imaging  process). 
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We  next  designed  a  second  set  of  experiments  to  test  five 
distinct  connection  patterns  of  component  strands  to  be 
carved  (carving  patterns)  in  a  canvas  without  external 
toeholds.  The  carving  samples  {[carving  st rands ] /[component 
strands]:  1:1,  2:1,  or  3:1)  from  patterns  1-5  were  incubated  at 
45  °C  overnight  before  agarose  gel  electrophoresis  and  AFM 
imaging.  When  the  sample  from  carving  pattern  1  was 
compared  with  those  from  patterns  2-5,  significantly  lower 
carving  yields  were  observed  for  pattern  1  by  gel  electro¬ 
phoresis  and  AFM  {see  the  Supporting  Information  for 
a  detailed  yield  study)* 

A  component  strand  to  be  carved  is  denoted  by  a  red  or 
blue  node*  A  blue  node  indicates  the  presence  of  an  unpaired 
single-stranded  domain.  Each  node  in  carving  pattern  1 
(Figure  3  a, b)  is  disjoint  from  any  other  node  in  the  pattern. 
In  contrast,  in  each  of  patterns  2-5  (Figure  3  e-b),  the  nodes 
to  be  carved  are  fully  connected,  such  that  a  path  consisting 
only  of  nodes  to  be  carved  exists  between  any  two  nodes 
within  the  carving  pattern*  Additionally,  each  of  these 
patterns  contains  at  least  one  blue  node  representing 
a  component  strand  at  the  end  of  a  constituent  helix  with 
exposed  poly-T  domain(s)  that  can  serve  as  an  external 
toehold  (Figure  3  e-fa). 

The  above  experiments  on  the  carving  patterns  demon¬ 
strate  that  appending  an  external  toehold  on  each  individual 
node  in  a  carving  pattern  composed  of  disjoint  nodes  can 
enable  effective  carving  (e.g*  pattern  1')  of  an  initially  hard- 
to-carve  pattern  (e.g*  pattern  1).  In  contrast,  effective  carving 
can  be  obtained  in  a  fully  connected  carving  pattern  (e.g* 
patterns  2-5)  composed  mostly  of  toehold-free  nodes  if  the 
pattern  also  contains  an  initial  blue  node  (e.g.  the  strand  with 
an  exposed  poly-T  domain  at  the  helix  end).  In  the  latter  case, 
the  carving  reaction  can  be  initiated  from  an  external  toehold 
in  a  blue  node,  and  followed  by  a  cascade  of  sequential 
exposure  of  initially  hidden  toeholds.  Detailed  schematics  of 
this  mechanism  can  be  found  in  the  zoomed-in  diagrams  of 
Figure  3 e,i,  For  example,  in  the  zoomed-in  diagrams  of 
Figure  3ef  the  strand  with  initial  toeholds  of  poly-T  domains 
{top,  shown  as  a  blue  node  and  as  blue  domains  in  the  strand 
diagram)  can  be  displaced  by  a  carving  strand*  The  displace¬ 


ment  of  this  strand  will  reveal  a  newly  exposed  toehold  on  its 
neighboring  strand  (bottom,  shown  as  a  blue  node  and  as 
a  blue  domain  in  the  strand  diagram),  allowing  this  neighbor 
to  be  displaced  next.  Following  this  mechanism,  a  cascade  of 
new  toeholds  will  be  sequentially  generated,  enabling  the 
removal  of  all  strands  in  the  carving  pattern.  Note  that  such 
a  consecutive  toehold  exposure  is  not  possible  in  the  disjoint 
pattern  (e.g*  pattern  1,  see  zoom-in  of  Figure  3  a)*  In  the  case 
of  carving  pattern  F,  where  each  node  contains  a  predesigned 
external  toehold,  each  components  can  be  displaced  inde¬ 
pendently  in  parallel  (zoom-in  of  Figure  3  b). 

In  the  cases  of  the  intaglio  alphabet  and  the  cavities  in  the 
relief  set  where  the  carving  patterns  contain  no  initial  external 
toehold,  the  cascading  could  be  initiated  from  the  transient 
dissociation  of  a  few  bases  on  an  internal  component,  or  an 
unpaired  domain  from  empty  neighboring  components  (such 
a  defect  rate  is  estimated  as  5-10  %  for  a  certain  component  in 
a  DNA  origami  structure.1451 

We  tested  the  reversibility  of  this  structural  reconfigura¬ 
tion  by  adding  back  the  displaced  components  following  the 
carving  of  pattern  4  (Figure  3 j,k,  pattern  R).  This  overnight 
reaction  at  45  °C  was  sufficient  to  glue  back  the  two  carved 
out  pieces  almost  seamlessly  with  a  yield  of  around  89  %  by 
gel  electrophoresis  and  78%  by  AFM  (Figure  3  k:  see 
Figures  SI  8  and  S33).  The  reassembled  canvases  were  then 
subjected  to  a  second  round  of  carving  (pattern  R5;  Fig¬ 
ure  31),  which  resulted  in  yields  similar  to  those  obtained 
directly  from  carving  the  original  canvas  (gel  yield  of  58% 
and  AFM  yield  of  72%;  Figures  SI  8  and  $34)* 

We  then  applied  this  carving  method  to  3D  DNA-brick 
cuboids.  By  using  a  10  helix  x  10  helix  x  80  base-pair  structure 
(Figure  4  a,  top)  that  we  reported  previously  J221  we  tested 
multiple  carving  patterns.  We  used  a  reaction  temperature 
(28  °C)  lower  than  that  for  the  2D  carving,  since  3D  structures 
contain  8  nt  binding  domains  and  are  less  thermally  stable 
than  the  2D  structures,  which  contain  10  or  11  nt  binding 
domains*  The  successful  results  of  carving  a  corner  off 
(Figure  4b,  top)  or  a  tunnel  through  a  cuboid  (Figure  4c, 
top)  and  carving  the  cuboid  into  two  halves  (Figure  4  d,  top) 
were  shown  by  agarose  gel  electrophoresis  (see  Figure  S37) 
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Figure  4.  Structural  reconfiguration  from  a  3D  cuboid.  Top;  cylinder  model  of  the  cuboid  (red  cylinders  denote  those  to  be  displaced);  bottom; 
TEM  images  (scale  bars:  20  nm).  a)  Cuboid  before  structural  reconfiguration,  b}  Carving  of  a  corner  from  the  cuboid,  c)  Carving  of  a  tunnel 
through  the  cuboid,  d)  Carving  of  the  cuboid  into  two  halves. 
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and  transmission  electron  microscopy  (TEM;  Figure  4, 
bottom}.  Because  of  the  limited  thermal  stability  and  the 
limited  accessibility  of  the  majority  of  the  component  strands, 
the  structural  reconfiguration  of  3D  DNA  structures  is  much 
more  difficult  than  that  of  the  2D  counterparts,  and  it  is  even 
more  challenging  to  obtain  fine  features.  Since  it  is  difficult  to 
find  a  temperature  high  enough  for  fast  strand  displacement 
but  low  enough  (<30°C)  to  maintain  structural  stability,  we 
were  only  able  to  demonstrate  a  few  cases  of  successful 
carving  with  coarse  resolution. 

Carving  may  not  appear  to  be  thermodynamically  favor¬ 
able,  since  a  component  strand  in  the  canvas  structure  that  is 
fully  paired  with  its  neighbors  has  the  same  designed  number 
of  hybridized  bases  as  when  it  is  bound  to  its  fully 
complementary  carving  strand  (after  carving).  However,  the 
carving  off  of  a  component  strand  may  help  to  alleviate  the 
electrostatic  repulsion  that  results  from  the  closely  packed 
neighboring  DNA  duplexes  in  the  canvas  structure^211  and 
release  the  mechanical  stress  that  might  be  accrued  at  the 
cross-over  points,!*"1 2  thereby  favoring  the  reaction. 

In  a  similar  way  to  typical  toehold-mediated  strand- 
displacement  circuits,  our  carving  system  shows  leakage 
(carving  can  be  initiated  in  a  connected  region  even  when 
no  external  toehold  is  present),  which  we  utilize  to  initiate 
carving.  However,  it  is  conceivable  that  under  other  reaction 
conditions  (e,g,  if  the  carving  reaction  is  carried  out  at  room 
temperature  over  a  shorter  period  of  time  rather  than  at  an 
elevated  temperature  (35-45 &C)  overnight  as  in  this  study), 
such  leakage  could  be  mitigated,  and  carving  may  only  start  at 
a  site  with  an  external  toehold.  Such  a  system  would  be 
analogous  to  a  strand-displacement  circuit  embedded  in 
a  nanostructure  (the  canvas),  in  which  case  the  strand- 
displacement  cascade  would  be  directly  coupled  with  the 
structural  change  at  single-strand  resolution.  This  system 
would  provide  a  platform  to  explore  the  rich  interplay 
between  structure,  dynamics,  and  computation.  For  example, 
it  would  be  interesting  to  design  spatial  logic  gates  and 
circuits:  A  multi-input  AND  gate  could  be  designed  as 
a  carving  path  with  the  component  strands  along  the  path 
serving  as  the  inputs  and  the  strand  at  the  end  of  the  path 
serving  as  the  output,  or  a  two-input  OR  gate  could  be 
designed  as  two  converging  paths  with  one  strand  on  each 
path  serving  as  the  inputs  and  the  strand  at  the  merging  point 
serving  as  the  output.  In  the  latter  case,  when  multiple  strands 
along  each  path  are  used  as  inputs,  these  two  converging  paths 
would  together  implement  a  disjunctive  normal  form  formula 
([At  AND  A?  AND  AND  AJ  OR  [BL  AND  E2  AND  ... 
AND 
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Sebnitzen  von  DNA-Nanostrukturen:  Die 
modular  verzahnte  Archltektur  emzel- 
strangiger  Kachel-  und  Ziegelstefn-Struk- 
turen  wurde  genutzt,  urn  eine  allgemeine 
Methode  fur  die  Strukturrekonfigu  ration 
zu  entwiekeln.  Das  Entfernen  eines 
Stranges  (siehe  Bild)  legt  einen  neuen 


Haltepunkt  an  eEnem  Nachbarstrang  frei, 
sod  ass  es  gelingt,  Abschmtte  aus  ver- 
bundenen  Strangen  auszuschneiden, 
ohne  dass  die  Strange  rrrit  vorab  entwor- 
fenen  exlernen  Haltepunkten  modifiziert 
werden  mussten. 
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ABSTRACT  Kineticaliy  controlled  isothermal  growth  is  fundamental  to 
biological  development,  yet  it  remains  challenging  to  rationally  design  molecular 
systems  that  self-assemble  isothermally  into  complex  geometries  via  prescribed 
assembly  and  disassembly  pathways.  By  exploiting  the  programmable  chemistry 
of  base  pairing,  sophisticated  spatial  and  temporal  control  have  been  demon¬ 
strated  in  DNA  self-assembly,  but  largely  as  separate  pursuits.  By  integrating  temporal  with  spatial  control,  here  we  demonstrate  the  "developmental"  self- 
assembly  of  a  DNA  tetrahedron,  where  a  prescriptive  molecular  program  orchestrates  the  kinetic  pathways  by  which  DNA  molecules  isothermally  self- 
assemble  Into  a  well-defined  three-dimensional  wireframe  geometry.  In  this  reaction,  nine  DNA  reactants  initially  coexist  metastably,  but  upon  catalysis  by 
a  DNA  initiator  molecule,  navigate  24  individually  chara derivable  intermediate  states  via  prescribed  assembly  pathways,  organized  both  in  series  and  in 
parallel,  to  arrive  at  the  tetrahedral  final  product.  In  contrast  to  previous  work  on  dynamic  DNA  nanotechnology,  this  developmental  program  coordinates 
growth  of  ringed  substructures  into  a  three-dimensional  wireframe  superstructure,  taking  a  step  toward  the  goal  of  kineticaliy  controlled  Isothermal 
growth  of  complex  three-dimensional  geometries. 

KEYWORDS;  molecular  programming  DNA  nanotechnology  ■  kinetic  pathway  control  ■  developmental  self-assembly 


Molecular  self-assembly,  a  funda¬ 
mental  process  underlying  the  de¬ 
velopment  and  operation  of 
biological  organisms,  has  emerged  as  an 
important  engineering  paradigm  for  nano¬ 
technology.  Biological  development  con¬ 
trols  molecular  arrangement  both  spatially 
and  temporally  to  produce  a  complex  or¬ 
ganism  that  robustly  responds  to  its  chemi¬ 
cal  and  physical  environment.  By  contrast,  it 
has  remained  challenging  to  achieve  so¬ 
phisticated  spatial  and  temporal  control  in 
an  integrated  fashion  in  rationally  designed 
synthetic  biomolecular  systems. 

There  have  been  many  recent  advances 
in  the  design  of  sophisticated  synthetic 
nucleic  acid  systems  that  enable  either  spa¬ 
tial  or  temporal  control  of  molecular  self- 
assembly,  but  these  two  capabilities  have 
largely  been  relegated  to  separate  realms. 
Past  work  in  structural  DNA  nanotechnol¬ 
ogy,1  3  Including  the  synthesis  of  rib¬ 
bons,4'5  tubes,56  two-  and  three-dimen¬ 
sional  extended  crystals,6"11  and  discrete 
objects,12"21  has  largely  had  the  goal  of 
engineering  static  target  structures,  without 


an  explicit  focus  on  controlling  the  assem¬ 
bly  order  and  transient  dynamics  of  how 
individual  units  come  together  to  produce 
such  a  target  structure.  Conversely,  past 
work  in  dynamic  DNA  nanotechnology,22 
including  demonstrations  of  reconfigurable 
devices,24'25  autonomous  logical  circuits,26-29 
dynamic  self-assembling  systems, 
and  walkers,28'32"35  has  typically  focused 
on  either  engineering  the  transient  interac¬ 
tion  pattern  of  the  individual  molecular 
species  to  achieve  desired  computational 
or  kinetic  behavior,  without  explicit  concern 
for  the  structural  properties  that  the  assem¬ 
bling  and  disassembling  molecular  species 
produce,  or  has  introduced  limited  reconfi¬ 
gurability  to  an  otherwise  static  structure. 

To  capture  the  complexity  and  robust¬ 
ness  of  molecular  self-assembly  demon¬ 
strated  by  the  biological  developmental 
process,  it  is  necessary  to  design  the  tem¬ 
poral  in  addition  to  the  spatial  order  of  self- 
assembly,  This  would  allow  direct,  molecu¬ 
lar-scale  kinetic  control  over  the  entire  as¬ 
sembly  pathway,  rather  than  being  limited 
to  specification  of  the  final  structure  only. 
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Such  a  system  would  no  longer  need  a  thermal  anneal¬ 
ing  step  to  initiate  the  assembly  and  encourage  the 
dominance  of  the  desired,  fo west-energy  product;  the 
reaction  could  instead  proceed  isothermally  and  in  situ 
with  a  molecular  input  to  trigger  the  assembly.  This 
would  allow  the  construction  of  a  "develop mental" 
self-assembly  system  where  a  synthetic  structure 
"grows"  following  prescribed  kinetic  assembly  path¬ 
ways  organized  both  in  series  and  in  parallel  with 
some  reactions  occurring  sequentially  and  others  oc¬ 
curring  simultaneously,  ultimately  developing  into  the 
desired  complex  target  structure. 

To  address  this  largely  unexplored  challenge  of 
integrating  temporal  control  with  spatial  control  in 
synthetic  molecular  self-assembly,  we  demonstrate 
here  the  rational  design  and  kinetically  controlled 
isothermal  synthesis  of  a  DNA  tetrahedron  with  a 
well-defined  three-dimensional  wireframe  structure. 
The  assembly  is  the  execution  of  a  developmental 
molecular  program  specified  using  a  reaction  graph 
abstraction30  that  describes  the  kinetic  pathways  by 
which  the  DNA  reactants  self-assemble  and  disassem¬ 
ble.  This  work  builds  on  a  previous  demonstration  of 
programming  molecular  self-assembly  and  disassem¬ 
bly  pathways  using  a  versatile  DNA  hairpin  motif,  to 
execute  diverse  mol  ecu  tar  programs  including  cataly¬ 
tic  formation  of  branched  junctions,  cross-catalytic 
circuitry,  conditional  assembly  of  dendritic  structures, 
and  autonomous  locomotion.28  Unlike  these  previous 
demonstrations,  the  current  molecular  program  yields 
a  well-defined  three-dimensional  wireframe  structure 
formed  from  dosed  rings.  In  contrast  to  DNA  origami 
and  tile-based  approaches  to  structural  DNA  nanotech¬ 
nology,  our  method  is  initiated  by  a  catalytic  molecular 
trigger,  works  isothermally  within  a  wide  temperature 
range,  and  follows  a  predetermined  kinetic  pathway. 
We  envision  that  our  strategy  for  rationally  designing 
developmental  molecular  programs  can  be  general¬ 
ized  to  more  complex  three-dimensional  wireframe 
constructions. 

RESULTS  AND  DISCUSSION 

Kinetic  Pathway  Design.  The  tetrahedron  assembles 
from  six  hairpins 30  divided  into  A  and  8  groups,  and 
three  cooperative  hybridization  complexes37  making 
up  the  C  group  (Figure  la).  The  hairpins  are  designed 
with  structures  that  initially  keep  key  sequence  do¬ 
mains,  called  toeholds34 inaccessible;  each  hairpin  can 
then  be  opened  by  a  specific  initiator  strand,  allowing 
the  newly  accessible  toeholds  to  participate  in  down¬ 
stream  reactions.  Cooperative  hybridization  com¬ 
plexes  each  bind  to  two  initiators  and  are  used  in  ring 
formation  reactions.  Thus,  the  reactants  as  a  group  are 
meta stable,  so  that  no  reaction  appreciably  proceeds 
in  the  absence  of  the  initiator,  but  in  the  presence 
of  the  initiator  the  assembly  reaction  happens  auton¬ 
omously  and  follows  prescribed  kinetic  assembly 
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pathways  without  the  need  for  external  intervention. 

The  final  product  is  a  three-dimensional  tetrahedron 
with  edges  1 8  nm  long,  each  containing  five  turns  of 
DNA  (Figure  1b). 

The  assembly  of  the  tetrahedron  represents  the 
execution  of  a  prescribed  molecular  program.  The 
program  is  depicted  as  a  reaction  graph  using  a  nodal 
abstraction  that  concisely  describes  the  kinetic  path¬ 
ways  by  which  DNA  reactants  self-assemble  and 
disassemble,28  The  reaction  graph  emphasizes  the 
functional  relationship  between  reactant  complexes 
rather  than  the  detailed  structures  of  each  reactant 
(Figure  1c),  Each  molecular  species  is  represented  by  a 
node  depicted  as  a  black  ring  containing  triangles 
representing  input  ports,  and  circles  representing  out¬ 
put  ports.  The  initial  state  of  a  port  is  either  accessible1 
(open  symbol)  or  'inaccessible'  (solid  symbol).  The 
ports  are  functionally  connected  through  an  internal 
logic  that  toggles  their  states  during  the  execution  of 
the  program,  as  described  below.  A  developmental 
mo  Secular  program  is  written  as  a  reaction  graph  by 
connecting  complementary  output  and  input  ports  on 
different  nodes  via  either  assembly  operations  (solid 
arrow)  or  disassembly  operations  (dashed  arrow). 

The  reaction  graph  precisely  defines  which  reac¬ 
tions  must  occur  in  series  and  which  reactions  may 
occur  in  parallel  For  the  tetrahedron,  the  reaction  can 
only  start  with  the  assembly  of  I  to  A1,  because  this  is 
the  only  assembly  reaction  where  both  participating 
ports  (the  output  port  on  I  and  the  input  port  on  At) 
are  initially  accessible.  The  BT  and  A2  nodes  cannot 
assemble  with  AT  until  A1  has  been  opened  (via 
assembly  with  1)  and  its  output  ports  have  become 
accessible,  but  these  two  assembly  reactions  are  not 
dependent  on  each  other  because  they  are  on  separate 
branches  downstream  of  the  AT  assembly.  This  is  In 
contrast  with  traditional  thermodynamic  assemblies, 
where  all  parts  of  each  molecule  are  initially  accessible, 
with  no  explicit  control  of  assembly  order  (Figure  1c, 
right). 

The  ports  each  correspond  to  a  physical  region  on  a 
strand  or  complex  that  has  a  single  function  during  the 
assembly  process.  For  hairpins,  the  single  input  region 
is  represented  by  an  orange  triangle,  and  the  two 
output  regions  are  colored  according  to  their  position; 
the  blue  ports  overlap  the  loops,  while  the  green  ports 
overlap  the  tails.  The  nodal  abstraction  Is  capable  of 
representing  the  state  of  the  molecular  program  at 
various  points  In  its  execution:  in  a  hairpin  assembly 
reaction  (Figure  Id,  top),  the  hairpin  input  port  and  the 
corresponding  initiator  output  port  are  both  initially 
open,  indicating  that  a  reaction  is  possible.  After 
assembly,  a  line  is  drawn  between  these  ports  indicat¬ 
ing  that  they  are  now  bound,  and,  importantly,  the  two 
output  ports  on  the  hairpin  change  state  from  dosed 
to  open.  This  corresponds  to  the  structural  change, 
where  the  1*  toehold  on  the  single-stranded  initiator 
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Figure  1,  Catalytic  self- assembly  of  a  DNA  tetrahedron,  (a)  Overview  of  the  reaction,  (bj  A  computer- rendered  model16  of  the 
tertiary  structure  of  the  assembled  tetrahedron,  (t)  The  reaction  graph  of  the  developmental  molecular  program  specifying 
kinetically  controlled  self-assembly  (left}  compared  to  a  traditional  self-assembly  process  that  lacks  pathway  control  (right). 
Solid  and  dashed  arrows  depict  kinetically  controlled  assembly  and  disassembly  operations;  line  segments  depict  assembly 
operations  that  are  not  kinetically  controlled,  (d)  Execution  schematics  of  two  elementary  reactions.  Left,  molecular 
structures;  right  corresponding  nodal  abstractions,  where  lines  are  added  to  connect  ports  once  assembly  has  occurred. 
The  strand  regions  are  colored  the  same  as  the  corresponding  ports  in  the  nodal  representation.  Top,  a  hairpin  assembly 
reaction.  Bottom,  a  cooperative  assembly  reaction,  (e)  Execution  of  the  developmental  molecular  program  along  one  possible 
assembly  pathway.  Only  active  toehold  domains  are  labeled  in  this  figure,  with  newly  hybridized  toeholds  labeled  in  pink. 
Top,  molecular  structures;  bottom,  corresponding  nodal  execution  schematics.  See  Supporting  Information  Figure  Si  for  a 
schematic  showing  all  sequence  domains,  (f)  The  full  set  of  intermediates  along  the  prescribed  assembly  pathways.  A  three- 
letter  code  is  used  to  identify  each  species  as  explained  in  the  text.  Species  that  are  structurally  congruent  are  linked  by  gray 
boxes.  The  numbers  of  assembled  strands  (excluding  the  initiator)  for  each  row  are  displayed.  The  pathway  in  panel  e 
corresponds  to  the  reactions  along  the  left  edge  of  this  figure. 


hybridizes  with  the  exposed  1  toehold  on  the  hairpin, 
beginning  a  strand  displacement  that  opens  the  hair¬ 
pin  and  exposes  the  two  initially  hidden  toeholds  2* 
and  A*.  A  disassembly  reaction28  (not  shown)  is  repre¬ 
sented  by  a  dashed  arrow,  and  represents  the  displa¬ 
cement  of  a  previously  assembled  port  with  a  new  one. 


This  type  of  reaction  is  used  in  the  tetrahedron  to  displace 
the  initiator  partway  through  the  assembly,  allowing  it  to 
catalyze  the  assembly  of  further  tetrahedra. 

Cooperative  complexes,37  rather  than  hairpins, 
were  used  for  the  ring  forming  reactions  because  they 
were  observed  to  reduce  the  formation  of  aggregates 
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and  multimeric  products,  which  are  the  result  of 
intermo  locular  interactions  outcompeting  the  desired 
intramolecular  ones  (see  Supporting  Information 
Section  S3 .3).  Cooperative  hybridization  complexes 
contain  two  toeholds  and  are  assembled  into  the  growing 
complex  only  in  the  presence  of  both  initiator  domains 
(Figure  Id,  bottom).  This  is  because  the  singly  bound 
intermediate  is  short-lived  due  to  the  fact  that  neither 
initiator  domain  can  fully  displace  the  protector  strand 
by  itself,  leading  to  quick  disassembly  if  the  second 
initiator  is  not  immediately  available.37  In  molecular 
terms,  the  depicted  cooperative  complex  has  two 
toeholds,  7  and  10.  Upon  binding  to  the  exposed  7* 
toehold  In  the  blue  initiator,  the  protector  strand  is 
partially  displaced.  This  initial  reaction  is  by  itself 
reversible;  In  the  absence  of  the  green  initiator,  the 
blue  initiator  will  rapidly  disassemble.  However,  if  the 
green  initiator  with  toehold  10  is  also  present,  the  10 
and  10*  toeholds  may  then  bind,  allowing  the  protec¬ 
tor  strand  to  be  completely  and  irreversibly  displaced. 
In  the  tetrahedron,  all  cooperative  assembly  events  are 
intended  to  occur  intramoleculariy,  involving  two  in¬ 
itiator  domains  on  the  same  molecule,  and  thus,  each 
such  assembly  causes  the  formation  of  a  ring.  See 
Supporting  Information  Section  S4  for  additional  de¬ 
tails  on  the  nodal  abstraction  and  reaction  graphs. 

Figure  1  e  depicts  an  example  execution  trajectory 
for  the  molecular  program.  The  top  panel  shows  the 
molecular  structures  and  the  bottom  panel  shows  the 
corresponding  state  of  the  molecular  program  using 
the  nodal  abstraction.  The  assembly  begins  with  the 
sequential  opening  of  the  Al,  A2,  and  A3  hairpins 
causing  self-assembly  of  a  three-arm  junction30  corre¬ 
sponding  to  the  first  vertex  of  the  tetrahedron 
(reactions  1  -3).  The  initiator  I  is  released  upon  com¬ 
pletion  of  the  junction  (reaction  3),  allowing  It  to 
catalyze  the  assembly  of  further  tetrahedra.  Next,  each 
arm  of  this  junction  then  extends  to  incorporate  the 
B1  r  B2,  and  B3  reactants  through  another  set  of  hairpin 
opening  reactions  (reactions  4 — 6).  Finally,  the  over¬ 
hanging  portions  of  the  B  strands  are  involved  in  three 
ring  forming  reactions  using  a  set  of  cooperative 
hybridization  complexes37  incorporating  the  Cl,  C2r 
and  C3  strands  (reactions  7-9). 

Because  the  assembly  was  designed  to  proceed  in  a 
branched  fashion,  the  assembly  of  the  three  indepen¬ 
dent  branches  is  not  synchronized.  Thus,  there  are 
multiple  potential  paths  leading  to  the  desired  pro¬ 
duct,  involving  a  total  of  24  on-pathway  intermediates 
(Figure  If),  We  identify  each  intermediate  of  the  tetra¬ 
hedron  formation  with  a  three-letter  code  where  each 
letter  denotes  the  progress  of  one  of  the  three 
branches  of  assembly.  For  example,  in  the  intermediate 
ABA,  At  has  been  incorporated  in  the  first  branch,  A2 
and  B2  are  in  the  second  branch,  and  A3  is  in  the  third 
branch  (note  again  that  A2  incorporation  is  a  prere¬ 
quisite  of  B2  incorporation).  Letter  X  is  used  to  denote 


the  lack  of  assembly  of  the  A  strand  afong  the  corre¬ 
sponding  branch;  if  the  third  letter  is  X,  the  initiator  I  is 
still  attached  (for  example,  MAX  contains  Al,  A2,  and  I). 

See  Supporting  Information  Figure  S3  for  a  detailed 
description  of  the  codes.  Note  that  complexes  that  are 
related  by  cyclic  permutations  of  their  abbreviations 
have  congruent  secondary  structures;  these  are  linked 
by  gray  boxes  in  the  figure. 

Gel  Electrophoresis  Studies.  We  studied  the  formation 
of  the  tetrahedron  using  a  gel  electro  phoresis  mobility 
shift  assay  (Figure  2),  where  each  intermediate  was 
individually  synthesized  by  mixing  the  initiator  with 
different  subsets  of  the  nine  reactants  of  the  tetrahe¬ 
dron  assembly.  We  were  able  to  observe  the  formation 
of  each  of  the  24  possible  intermediates  as  a  distinct 
band  in  a  native  polyacrylamide  gel  (lanes  2-25). 
Intermediates  with  greater  numbers  of  incorporated 
strands  migrated  more  slowly,  with  the  exception  of 
the  ring  formation  transitions,  which  tended  to  have 
unpredictable  effects  (e.g.,  BBX  —  CBX  and  BBA  —  CBA 
showed  little  mobility  change,  and  BBB^CBB  showed 
increased  mobility).  This  is  likely  because  the  first  ring 
formation  reduces  the  angle  between  two  of  the  arms 
from  its  natural  angle  to  ^60°,  making  the  complex 
more  compact  Groups  of  intermediates  with  congru¬ 
ent  structures,  linked  by  the  gray  boxes  in  the  figure, 
were  observed  to  have  nearly  identical  mobility,  con¬ 
sistent  with  our  expectations.  The  band  identified  as 
the  assembled  tetrahedron  (lane  26)  had  a  mobility 
distinct  from  any  of  the  intermediates.  The  tower- 
mobility  bands  above  the  tetrahedron  band  are  hy¬ 
pothesized  to  be  mui timers  of  the  tetrahedron  struc¬ 
ture,  formed  when  assembly  of  the  cooperative 
complexes  cause  the  intermolecular  joining  of  two 
different  growing  tetrahedra  rather  than  the  desired 
intramolecular  ring  forming  reactions  (Supporting  in¬ 
formation  Figure  S4). 

When  all  reacta nts  except  l  were  incubated  to¬ 
gether  (lane  30),  only  a  small  amount  of  unintended 
“leakage"  products  was  observed:  3.5%  of  the  signal 
was  in  the  product  band  and  80%  was  in  the  reactants 
band.  When  at  least  0.25  equiv  of  \  was  included  (lanes 
26-28),  the  reaction  proceeded  to  completion  (that  is, 
all  Al  hairpins  were  consumed},  indicating  a  catalytic 
turnover  of  at  least  4,  Typical  yields  of  the  tetrahedron 
varied  between  20  and  40%,  depending  on  concentra¬ 
tion  (Supporting  Information  Figure  55).  At  a  higher 
reactant  concentration  (100  nM,  fanes  1-30),  the 
dominant  side  products  were  the  higher-weight  pro¬ 
ducts,  white  at  a  lower  concentration  (10  nM,  lanes 
31-35),  the  dominant  side  products  were  intermedi¬ 
ates  of  lower  molecular  weight.  Varying  the  toehold 
lengths  of  the  cooperative  complexes  was  not  ob¬ 
served  to  qualitatively  improve  the  reaction  yield 
(Supporting  information  Figure  56). 

We  observed  that  the  reaction  yield  was  primarily 
bottienecked  by  the  ring-forming  reactions.  On  the 
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Figure  2,  Characterization  of  the  tetrahedron  assembly  pathway.  This  is  a  gel  electrophoresis  mobility  shift  assay  where  lanes 
1  “25  show  all  on- pathway  intermediates  of  the  tetrahedron  assembly,  formed  by  mixing  different  subsets  of  reactants  with 
the  initiator.  Gray  boxes  mark  groups  of  intermediates  that  are  structurally  congruent  and  expected  to  have  the  same 
mobility,  and  the  structures  of  the  intended  products  are  shown,  as  in  Figure  If,  Lanes  26-35  show  the  analysis  of  catalytic 
turnover  at  two  concentrations,  showing  reactions  containing  all  nine  reactants  but  varying  concentrations  of  initiator  L 
These  are  6%  native  polyacrylamide  gels  of  assembly  reactions  containing  1  equiv  of  all  reactants  at  the  specified 
concentration,  except  A1  for  which  we  used  0,9  equiv  of  a  FAM  fluorophore-labeled  hairpin  to  observe  incorporation  yields. 
The  initiator  was  included  at  1  equiv  uni  ess  otherwise  specified  in  the  figure.  The  assembly  reactions  were  conducted  at  room 
temperature  in  TAE/Mg3’  buffer  containing  12.5  mM  Mg2,  over  19  h.  The  dotted  line  separates  two  gel  slabs  that  were  run 
simultaneously,  and  the  solid  line  separates  gels  that  were  run  at  different  times.  The  intensity  of  the  FAM  fluorescent  label  is 
shown  in  red,  and  SYBR  Gold  staining  intensity  is  shown  in  blue;  these  channels  are  separated  in  Supporting  Information 
Figure  52.  See  Supporting  Information  Figure  S4  for  an  agarose  gel  of  these  same  samples,  in  which  the  side  products  of 
higher  molecular  weight  are  well-resolved. 


ba  si  s  of  analysis  of  Figure  2,  the  yield  of  SHS,  the  largest 
intermediate  with  no  rings,  was  83%  for  six  hairpin 
incorporation  events.  However,  the  average  yield  of 
the  seven  intermediates  containing  one  C  strand,  and 
thus  one  formed  ring,  dropped  to  71%.  With  two  C 
strands,  the  yield  was  reduced  to  50%,  and  the  yield  of 
the  full  tetrahedron  was  only  19%.  This  drop  in  yield  is 
likely  the  result  of  undesired  intermolecular  interac¬ 
tions  outcompeting  the  desired  intramolecular  ring 
formation  during  the  assembly  of  the  cooperative 
complexes.  This  may  be  exacerbated  by  steric  strain 
introduced  in  the  ring  formations.  Although  the  singie- 
nudeotide  spacers  and  nicks  at  each  junction  were 
meant  to  relieve  this  strain,  it  is  not  clear  if  this 
was  sufficient,  and  the  use  of  longer  single-stranded 
spacers  at  the  vertices  might  further  improve  the  yield. 

Gel  electrophoresis  studies  showed  that  the  as¬ 
sembly  is  temperature-robust,  working  isothermally 
at  temperatures  in  roughly  the  range  16-41  °C 
(Supporting  Information  Figure  S7).  At  room  tempera¬ 
ture  at  100  nM,  the  assembly  reaction  was  observed 
to  be  complete  after  9  h  (Supporting  Information 
Figure  S8), 

While  the  unique  mobility  of  the  CCC  band  indi¬ 
cates  that  all  nine  strands  are  incorporated  into  a  si  ngle 
complex,  additional  assays  are  required  to  demon¬ 
strate  ring  formation  and  distinguish  it  from,  for 
example,  a  large  floppy  three-arm  junction.  To 
demonstrate  the  ring  formation,  we  conducted  a 


fluorescence-quenching  assay  (Figure  3),  using  C 
strands  functionalized  with  a  fluorophore  and  option¬ 
ally  a  quencher.  The  formation  of  each  ring  caused  the 
fluorophore  from  one  C  molecule  to  become  cotoca- 
lized  with  the  quencher  on  the  neighboring  strand, 
quenching  the  fluorescent  signal.  For  example,  lane  1 
contained  a  CBB  complex  where  a  red  fluorophore  was 
attached  to  the  Cl  strand;  as  expected,  a  red  fluores¬ 
cent  band  was  visible  on  the  gel.  Lane  2  contained  a 
CCS  complex  where  red  and  green  fluorophores  were, 
respectively,  attached  to  the  Cl  and  C2  strands;  both 
red  and  green  fluorescence  were  detected  in  the  target 
band,  as  expected.  Lane  3  showed  another  CCS  com¬ 
plex  which  additionally  had  a  quencher  attached  to  the 
C2  strand;  here,  green  but  not  red  fluorescence  signal 
was  detected,  indicating  that  proper  ring  formation 
had  resulted  in  the  cofocalization  of  the  quencher  on 
C2  with  the  red  fluorophore  on  Cl,  thus  removing  the 
red  fluorescence  from  the  product  band.  In  the  full 
tetrahedron,  all  three  fluorophores  were  fully  visible  in 
the  absence  of  quenchers  (lane  10)  but  fully  quenched 
when  quenchers  were  added  [lane  11 },  indicating  that 
all  three  ring  formations  were  simultaneously  success¬ 
ful,  and  that  the  tetrahedron's  overall  structure  formed 
as  designed. 

Microscopy  Studies,  We  further  confirmed  the  correct 
formation  of  the  geometric  structures  of  the  tetrahe¬ 
dron  and  its  key  intermediates  using  atomic  force 
microscopy  (AFM)  BBB  appeared  as  a  three-arm 
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Figure  3*  Characterization  of  ring-forming  reactions.  In  this  fluorescence-quenching  assay,  each  of  the  three  C  strands  was 
functionalized  with  a  different  fluorophore  at  its  5’  end  (Cl  -TYE665,  red;C2-TAMRA,  green,  C3-F AM,  blue);  F  represents  a  C 
strand  with  a  fluorophore  only,  while  Q  represents  a  strand  with  a  fluorophore  expected  to  be  quenched  by  a  quencher  on  the 
3'-end  of  a  neighboring  C  strand*  Lanes  1  -3  show  the  structures  FBB  {CBB  with  a  red  fluorophore  on  the  Cl  strand),  FFB  [CCB 
with  red  and  green  fluorophores  on  the  Cl  and  C2  strands,  respectively),  and  QFB  (CCB  with  the  same  two  fluorophores,  plus  a 
3' -quencher  on  the  C2  strand  that  quenches  the  red  5 '-fluorophore  on  the  Cl  strand).  Lanes  4-G  and  7-9  show  the  other  two 
structural  permutations.  Lane  10  shows  FFF  (the  full  tetrahedron  with  all  fluorophores),  and  lane  1 1  shows  QQQ  (the  full 
tetrahedron  with  all  fluorophores  and  all  quenchers)*  Lanes  12  and  13  are  the  same  lanes  in  the  same  gel  as  lanes  10  and  1  lr 
but  after  staining  with  SYBR  Gold.  This  is  a  6%  native  polyacrylamide  gel  of  a  10  nM  assembly  reaction  with  1  equiv  of  initiator, 
conducted  at  room  temperature  in  T  A  E/Mg2  *  buffer  containing  12,5  mM  Mg2  over  22  h* 


junction,  CBB  as  a  single  triangle,  CCB  as  a  double 
triangle,  and  CCC  as  a  flattened  tetrahedron,  ail  con¬ 
sistent  with  their  designed  shapes  (Figure  4a).  Addi¬ 
tionally,  digestion  at  an  intentionally  engineered  endo¬ 
nuclease  restriction  site  on  a  specific  edge  converted  a 
full  tetrahedron  back  to  a  double  triangle  pattern 
under  AFM,  as  expected.  The  double  triangle  structure 
contained  two  noncongruent  types  of  vertices,  which 
were  distinguished  by  attaching  a  streptavidin  mole¬ 
cule  to  one  of  them;  digestion  with  either  of  two 
endonucleases  targeted  to  two  different  edges  yielded 
images  of  structures  with  the  streptavidin  on  the 
expected  vertex,  further  confirming  the  correct  forma¬ 
tion  of  the  full  tetrahedron  (Figure  4b), 

CONCLUSIONS 

We  have  extended  the  pathway-controlled  DNA 
seif-assembly  approach,2*  demonstrating  the  develop¬ 
mental  growth  of  a  well-defined  three-dimensional 
wireframe  DNA  tetrahedron,  formed  from  meta- 
stable  reactants  conditionally  in  the  presence  of  a 
single-stranded  trigger  and  through  a  kinetically  pro¬ 
grammed  pathway*  In  particular,  this  is  the  first  cataly¬ 
tic  self-assembly  design  that  incorporates  ring  forming 
events,  a  potentially  useful  topological  primitive  for  the 
growth  of  larger  assemblies  with  well-defined  geome¬ 
tries,  Ring  formation  is  challenging  because  the  desired 
intramolecular  reaction  that  forms  the  ring  must  com¬ 
pete  with  off-pathway  intermoiecular  reactions  that 
lead  to  multimers  and  other  aggregation  side  pro¬ 
ducts.  Using  cooperative  hybridization  complexes  for 
the  ring  forming  reactions  appears  to  avoid  significant 
difficulties  encountered  with  a  set  of  noncooperative 
designs  (Supporting  Information  Section  S3.3)*  We 
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believe  that  even  greater  improvements  in  yield  could 
be  achieved  by  designing  the  geometry  of  ring  form¬ 
ing  steps  so  that  they  do  not  require  such  large 
conformational  changes  in  the  molecule.  This  would 
favor  the  desired  Intramolecular  reaction  both  thermo¬ 
dynamically  and  kinetically,  because  the  reaction 
would  induce  less  strain  and  would  have  an  increased 
attempt  frequency. 

This  work  represents  an  advance  over  previous 
attempts  to  control  assembly  order,  One  previous 
approach  imposes  assembly  order  by  designing  differ¬ 
ent  parts  of  the  structure  to  have  distinct  melting  or 
formation  temperatures,  usually  by  using  binding  do¬ 
mains  of  different  lengths.  The  assembly  events  thus 
occur  in  a  particular  order  as  the  temperature  is 
lowered  during  the  annealing  process*  This  has  been 
used  a  number  of  times,3ar3a  including  with  some 
poly hedra. 4041  However,  such  approaches  typically 
require  external  modulation  of  the  assembly  environ¬ 
ment  in  the  form  of  an  annealing  ramp,  coupling  the 
assembly  order  to  bulk  environmental  conditions  such 
as  temperature.  Also,  while  other  approaches  have 
recently  been  demonstrated  that  work  isothermally 
within  narrow  ranges  of  temperatures,20  no  explicit 
assembly  order  control  is  designed  in  these  systems.  In 
contrast,  our  approach  allows  direct,  molecular-level 
control  of  assembly  order  through  complete  pathway 
engineering,  is  independent  of  bulk  properties  like 
temperature,  and  operates  isothermally  over  a  wide 
range  of  temperatures  that  includes  biologically  rele¬ 
vant  conditions. 

Another  route  for  direct  isothermal  molecular  control 
of  assembly  order  is  the  algorithmic  self-assembly  of 
static  DNA  tiles  Here,  in  a  seeded  growth  system, 
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Figure  4.  AtomK  force  microscopy  images  of  tetrahedron  self-assembly,  (a)  Images  of  purified  samples  of  the  intermediates 
B8B,  CBB>  CCB,  and  the  full  tetrahedron  CCC,  which,  respectively,  appeared  as  a  three-arm  junction,  a  single  triangle,  a  double 
triangle,  and  a  three-  or  four-lobed  structure  corresponding  to  a  flattened  tetrahedron,  each  consistent  with  our  design.  In 
each  image,  the  double-stranded  edges  of  each  wireframe  structure  are  clearly  resolved,  (b)  Images  of  the  full  tetrahedron  cut 
by  restriction  endonucleases  followed  by  incubation  with  streptavidin.  Digestion  with  either  of  two  endonucleases  targeted 
to  different  edges  restored  the  double-triangle  structures,  with  the  streptavidin  (appearing  as  small  white  circular  features  in 
the  AFM  image)  appearing  at  the  expected  biotin-modified  vertices.  In  the  schematic  diagrams,  the  black  bar  intersecting 
tetrahedron  edge  represents  the  designed  restriction  site,  the  black  dot  is  the  S'-biotin,  and  the  yellow  circle  is  the 
streptavidin.  The  scale  bars  of  the  larger  images  are  1  DO  nm  long;  the  inset  images  are  62,5  nm  in  width,  at  double  the  scale  of 
the  larger  images.  These  samples  were  assembled  at  a  concentration  of  100  nM  and  then  purified  by  glycerol  gradient 
ultracentrifugation;  for  panel  br  the  restriction  and  streptavidin  binding  were  performed  before  purification. 


tiles  are  incorporated  sequentially  at  particular  posi¬ 
tions,  as  directed  by  cooperative  binding  to  neighbor¬ 
ing  tiles  on  the  growth  front,  However,  this  tile-based 
approach  is  dependent  upon  the  supersaturated  nature 
of  the  tile  species,  and  is  limited  to  the  specific  geometry 
of  til  abased  lattices  with  a  growth  front  More  broadly, 
this  tile-based  approach  involves  "static"  monomers  in¬ 
corporated  by  simple  binding  to  the  growing  structure 
without  explicitly  designed  internal  state  changes.  In 
contrast  to  "passive"  assembly  using  static  tiles,  our 
"active"  developmental  self-assembly  uses  reconfigurable 
hairpins  as  assembling  monomers  that  are  capable  of 
conditional  configurational  change  and  can  imple¬ 
ment  both  assembly  and  disassembly.  An  analytical 
model  has  found  that  methods  using  active  compo¬ 
nents  can  in  principle  be  exponentially  faster  than 
the  corresponding  passive  methods.45  Developmental 
assembly  with  active  components  provides  a  more 
versatile,  expressive  molecular  programming  language 


for  integrating  temporal  and  spatial  control,  and  a 
potentially  more  efficient  method  for  the  active  con¬ 
struction  of  molecular  structures, 

Kineticaliy  controlled  developmental  self-assembly 
mimics  the  we  1 1 -orchestrated  nature  of  biological  reac¬ 
tion  cascades,  which  operate  autonomously  and  with¬ 
out  the  need  for  external  intervention  to  maintain  their 
function.  The  triggered,  isothermal  nature  of  our  as¬ 
sembly  methodology  would  allow  the  formation  of 
complex  structures  to  be  integrated  with  nucleic  acid 
computational  circuits,  allowing  the  use  of  structural 
changes  as  an  output  mode  of  logical  computation. 
These  could  furthermore  be  interfaced  with  non-nu- 
detc  acid  inputs  such  as  proteins  and  small  molecules 
through  the  use  of  aptamers,30  which  would  allow 
these  systems  to  interact  with  the  larger  chemical 
world  and  respond  in  potentially  intricate  ways  to  their 
molecular  environment.  This  methodology  is  addition¬ 
ally  expected  to  be  more  amenable  to  in  vivo 
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applications,  where  thermal  annealing  is  not  an  avail¬ 
able  mode  of  assembly.  Thus,  synthetic  developmental 


EXPERIMENTAL  METHODS 

Sequence  Design.  Motif  design  was  performed  by  hand  using 
principles  described  previously26  and  in  Supporting  informa¬ 
tion  Section  S3 .1.  Sequence  design  was  performed  with  Multi- 
subjective  {version  1.0.2),  a  program  that  identifies  and 
eliminates  spurious  hybridizations  in  the  full  hairpin  system 
(manuscript  in  preparation),  In  association  with  Domain  Design 
(DD)  (version  0.2),  which  uses  simple  heuristics  to  evaluate  the 
acceptability  of  candidate  sequence  domains.44  A  local  copy  of 
NLfPACK  (version  3.0)  was  used  by  Multi  subjective  to  generate 
base- pairing  probabilities  for  further  analysis4546  See  Support¬ 
ing  Information  Section  $3.2  for  more  sequence  design  details. 
See  Supporting  Information  Section  S5  for  designed  sequences. 

Strand  Synthesis.  DNA  strands  were  synthesized  and  purified 
by  integrated  DNA  Technologies  (IDT),  including  strands  con¬ 
taining  phosphates  and  fluorophores.  For  the  fluorescent  C 
strands  used  in  the  fluorescence-quenching  assay  only,  we 
ordered  the  strands  in  two  halves,  ligated  them  using  27  U//iL 
of  T4  DNA  ligase  at  a  DNA  concentration  of  27  at  16  CC  for 
2  h,  and  purified  the  product  by  denaturing  polyacrylamide  gel 
electrophoresis.  In  all  cases,  we  quantitated  the  concentrations 
of  DNA  stock  solutions  by  measuring  the  ultraviolet  light  absorp¬ 
tion  at  260  nm  with  the  microvolume  pedestal  of  a  NanoDrop 
2000c  spectrophotometer,  taking  the  average  of  measurements  of 
three  samples  for  each  strand,  and  using  extinction  coefficients 
provided  by  IDT  to  calculate  the  concentration  of  each  strand. 

Sample  Preparation.  The  hairpin  strands  were  separately 
heated  to  95  *C  for  5  min  and  allowed  to  cool  to  room  tem¬ 
perature  over  about  20  min.  Cooperative  hybridization  com¬ 
plexes  were  annealed  in  the  same  way,  except  that  1 .5  equiv  of 
the  Cp  strands  was  combined  with  1  equiv  of  the  C  strand 
before  annealing.  The  desired  reactants  were  then  combined 
to  a  final  concentration  of  100  nM  (Figure  2,  fanes  1  -30  and 
Figure  4}  or  10  nM  (Figure  2,  lanes  31  —35  and  Figure  3)  of  each 
reactant  except  Al,  for  which  0.9  equiv  was  used  instead  in 
order  to  obviate  the  effects  of  inaccuracies  in  stoichiometry  and 
assist  in  calculation  of  the  yield.  The  assembly  reactions  were 
performed  at  room  temperature  over  about  20  h  in  TAE/Mg2+ 
buffer  containing  40  mM  Tris  base,  20  mM  acetic  acid,  1 2.7  mM 
EDTA,  and  l2.5mM  MgCI*. 

For  the  formation  assay  (Figure  2),  an  Al  strand  incorporat¬ 
ing  a  fluorescent  label  was  used  instead  of  the  regular  Al 
strand.  For  the  fluorescence-quenching  assay  (Figure  3),  the  C 
strands  each  had  a  different  fluorophore  on  one  end  and  an 
optional  quencher  on  the  other,  as  described  in  the  figure 
caption.  For  the  CCC  samples  in  the  atomic  force  microscopy 
studies  (Figure  4a),  we  used  C  strands  that  contained  phos¬ 
phates  on  their  5'-ends;  this  was  not  expected  to  affect  the 
assembly.  For  the  streptavidin  labeling  experiment  (Figure  4b), 
we  used  an  A2  strand  containing  a  5"  biotin. 

For  the  atomic  force  microscope  studies,  the  samples  were 
purified  by  ultracentrifugation  in  a  15  -45%  glycerol  gradient  in 
1  x  TAE/Mg2  "  for  3  h  at  50000  rpm.  Fractions  of  100  were 
taken  from  the  centrifuge  tube,  and  the  fractions  containing  the 
desired  product  were  identified  by  native  gel  electrophoresis. 
The  glycerol  was  not  removed  after  the  purifications,  meaning 
that  subsequent  manipulations  were  performed  in  buffer  con¬ 
taining  about  20%  glycerol. 

Gel  Electrophoresis.  The  gels  in  Figures  2  and  3  were  precast 
6%  native  polyacrylamide  gels  run  at  100  V  for  55  min  at  21  QC 
using  lx  TBE  running  buffer.  For  each  lane,4/rL  of  the  reaction 
mixture  was  mixed  with  an  equal  amount  of  2x  native  loading 
buffer  containing  a  small  amount  of  bromophenol  blue  in  2x 
TAE/Mgi+  buffer  and  10%  glycerol.  Then,  4  /rL  of  this  mixture 
containing  0,2  pmol  of  the  specified  complex  was  then  loaded 
into  the  gel. 

The  gels  were  imaged  with  a  Typhoon  FLA  9000  gel  scanner. 
The  gels  in  Figure  2  and  the  right  Side  inset  of  Figure  3  were 
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stained  in  SYBR  Gold  for  about  20  min  before  imaging;  alt  other 
gets  were  unstained.  For  Figure  2,  we  imaged  the  gei  unstained 
using  the  fluorescent  label,  and  then  stained  it  and  reimaged 
the  same  gel.  The  two  images  were  then  manually  overlaid. 

We  quantitated  the  yield  by  dividing  the  fluorescence 
intensity  of  the  desired  product  by  the  intensity  of  the  entire 
lane  in  the  fluorescently  labeled  channel  (shown  as  red)  in 
Figure  2,  using  ImageQuant  TL.  We  used  automatic  band 
detection  for  the  bands  representing  the  desired  product  and 
the  reactants  band  if  it  was  present.  We  then  defined  multimers 
as  the  lane  area  with  lower  mobility  than  the  desired  product 
band,  and  intermediates  as  the  lane  area  between  the  product 
and  reactants  band,  for  background  subtraction,  we  used  the 
rolling  ball  method  with  a  radius  of  500. 

Atomic  Force  Microscopy.  The  samples  in  Figure  4a  were  im¬ 
aged  directly  after  glycerol  purification.  For  the  streptavidin 
labeling  experiment  (Figure  4b),  the  appropriate  digestion 
buffer  was  added  after  assembly  to  a  final  concentration  of 
1  x,  and  the  structures  were  digested  with  one  of  the  two 
restriction  endonucleases  BstZl7!  or  5cal-HF  at  a  concentration 
of  1  U/^L  at  37  *t  for  2  h.  The  structures  were  then  incubated 
with  a  10  x  solution  of  streptavidin  for  30  min  and  then  glycerol 
purified. 

For  imaging,  we  added  to  a  freshly  cleaved  mica  surface, 
30  ptL  of  filtered  5x  TAE/Mg2+  buffer,  followed  by  30  fit  of  a 
10  mM  solution  of  NiCI^  to  increase  the  strength  of  the 
DNA -mica  binding.  After  5  min,  we  added  10-30  pt  of  the 
glycerol -purification  fraction  containing  our  desired  product 
AFM  images  were  obtained  using  a  Multimode  8  scanning 
probe  microscope  with  a  Digital  Instruments  Nanoscope  V 
controller,  images  were  collected  in  aqueous  phase  using 
tapping  mode,  using  the  short  and  thin  cantilevers  in  the 
SNL-10  silicon  nitride  cantilever  chip.  Within  each  of  the  two 
panels  of  Figure  4,  the  images  were  all  generated  using  the 
same  tip. 
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Multiplexed  3D  cellular  super-resolution  imaging 
with  DNA-PAINT  and  Exchange-PAINT 

Ralf  Jrnigmann1,2'6,  Maier  S  Avendano1’2*6,  Johannes  B  Woehrstein1'6,  Mingjie  Dai1,3,  William  M  $hih1|4i5  6c  Peng  Yin1,2 


Super-resolution  fluorescence  microscopy  is  a  powerful  tool 
for  biological  research,  but  obtaining  multiplexed  images  for 
a  large  number  of  distinct  target  species  remains  challenging. 
Here  we  use  the  transient  binding  of  short  fluorescently  labeled 
oligonucleotides  {DNA-PAINT,  a  variation  of  point  accumulation 
for  imaging  in  nanoscale  topography)  for  simple  and 
easy-to-implement  multiplexed  super-resolution  Imaging  that 
achieves  sub-lG-nm  spatial  resolution  in  vitro  on  synthetic 
DNA  structures.  We  also  report  a  multiplexing  approach 
(Exchange-PAINT)  that  allows  sequential  imaging  of  multiple 
targets  using  only  a  single  dye  and  a  single  laser  source.  We 
experimentally  demonstrate  ten-color  super-resolution  imaging 
in  vitro  on  synthetic  DNA  structures  as  well  as  four-color 
two-dimensional  (2D)  imaging  and  three-color  3D  imaging  of 
proteins  in  fixed  cells. 

Par- field  fluorescence  microscopy  has  undergone  major  advances 
since  Ihe  advent  of  methods  circumventing  Lhe  classical  diffrac¬ 
tion  limit,  i.e.,  super- resolution  microscopy* Most  implementa¬ 
tions  'switch1  molecules  between  fluorescence  on  -  and  off-states  to 
obtain  sub  diffraction  image  resolution.  This  switching  is  tradition¬ 
ally  obtained  in  two  ways;  targeted  switching  actively  confines  the 
fluorescence  excitation  to  an  area  smaller  than  the  diffraction  limit 
of  light  (for  example,  in  stimulated  emission  depletion,  or  STED, 
microscopy3),  whereas  stochastic  switching  uses  photoswitch  - 
able  proteins  or  photo  switch  able  organic  dyes  (for  example,  in 
photo  activated  localization  microscopy  (PALM)4  and  stochastic 
optical  reconstruction  microscopy  (STORM)1).  Although  these 
methods  offer  unprecedented  spatial  resolution,  they  tend  to  be 
technically  involved  to  implement,  and  multiplexing  for  a  large 
number  of  distinct  targets  is  generally  challenging. 

PAINT5"7  provides  an  alternative  stochastic  super- resolution 
imaging  method.  Here  imaging  is  carried  out  using  diffusing 
fluorescent  molecules  that  interact  transiently  with  the  sample. 
This  method  is  straightforward  to  implement  and  does  not  require 
specialized  equipment  or  conditions  to  obtain  photoswilching, 
thus  making  it  more  accessible  than  STED  or  STORM  for  labo¬ 
ratories  with  standard  instrumentation  and  sample- preparation 


capabilities.  Initially,  PAINT  was  applied  to  obtain  super-resolved 
images  of  cell  membranes5  and  artificial  lipid  vesicles5.  However, 
a  key  limitation  of  PAINTs  original  formulation  is  that  dyes  inter¬ 
act  with  the  sample  via  electrostatic  coupling  or  hydrophobic 
interactions.  This  limits  the  availability  of  PAINT- compatible 
dyes,  making  it  hard  to  simultaneously  image  specific  biomol¬ 
ecules  of  interest,  A  recent  implementation  of  PAINT  has  involved 
continuously  and  stochastically  labeling  specific  membrane 
biomolecules  with  fluorescent  ligands  (such  as  antibodies)6. 
The  approach,  termed  universal  PAINT,  achieves  specific 
dye-sample  interactions  but  still  lacks  the  ability  to  specify  inter¬ 
actions  with  programmable  kinetics.  Similarly  to  PAINT,  binding 
of  DNA  intercalating  dyes  has  also  been  used  to  obtain  super- 
resolved  images  of  DNAS  9. 

To  achieve  programmable  dye  interactions  and  to  increase  the 
specificity  and  the  number  of  usable  fluorophores,  DNA-PAINT 
was  developed10.  Here  stochastic  switching  between  fluorescence 
on-  and  off-states  is  implemented  via  repetitive,  transient  bind¬ 
ing  of  fluorescently  labeled  oligonucleotides  fimager'  strands) 
to  complementary  'docking'  strands  on  DNA  nanostructures 
(Fig.  !a,b).  In  the  unbound  state,  only  background  fluorescence 
from  partially  quenched10  imager  strands  is  observed  (Fig.  la). 
However,  upon  binding  and  immobilization  of  an  imager  strand, 
fluorescence  emission  is  detected  using  total- internal-reflection 
(T1R)  or  highly  inclined  and  laminated  optical  sheet  (HILO) 
microscopy11.  DNA-PAINT  enhances  PAINTs  simplicity  and 
ease  of  use  with  the  programmability  and  specificity  of  DNA 
hybridization.  Notably,  it  enables  a  wide  range  of  fluores¬ 
cence  on-  and  off- times;  these  can  be  adjusted  by  tuning  the 
binding  strength  and  concentration  of  the  imager  strand10, 
DNA-PAINT  has  been  used  to  obtain  multicolor  subdiffrac¬ 
tion  images  of  DNA  nanostructures1012-15  with  -25 -nm  spatial 
resolution14.  Spectral  multiplexing  is  straightforward  as  no 
external  photoswitching  of  dyes  is  necessary,  and  imaging 
specificity  is  obtained  through  orthogonality  of  DNA  sequences 
coupled  to  spectrally  distinct  dyes*3. 

By  linking  DNA-PAINT  docking  strands  to  antibodies,  we  have 
extended  the  DNA-PAINT  method  to  enable  multiplexed  2D  and 
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Figure  1  |  DNA-PAINT.  (a)  A  microtubule-like 
DNA  origami  polymer  (cylinders  represent 
DN A  double  helices)  is  decorated  with 
single -stranded  extensions  (docking  strands) 
on  two  opposite  faces  (red)  spaced  -16  nm 
apart.  Complementary  fluorescent  imager  strands 
transiently  bind  from  solution  to  docking 
strands.  Biotinylated  strands  (present  on  orange 
helices)  immobilize  the  structures  to  glass 
surfaces  for  fluorescence  imaging,  (b)  Transient 
binding  between  imager  and  docking  strands 
produces  fluorescence  blinking,  allowing 
stochastic  super-resolution  imaging. 

(c)  Transmission  electron  microscopy  image  of 
origami  polymers  with  a  measured  width  of 
16  ±  1  nm  (mean  ±  s.d.).  (d)  DNA- PAINT 
super-resolution  images  obtained  using 
Cy3b-labeled  Imager  strands  (15,000  frames, 

5-Hz  frame  rate).  Two  distinct  lines  are  visible,  (e)  Cross-sectional  histograms  of  the  boxed  areas  i  and  if  in  d  (arrows  denote  histogram  direction) 
both  reveal  a  distance  of -16  nm  as  designed  (FWHM  of  each  distribution  is  -7-10  nm).  Scale  bars,  40  nm. 
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3D  super-resolution  imaging  of  protein  components  in  fixed 
ceils.  We  report  sub-ID-nm  lateral  imaging  resolution  in  vitro  of 
synthetic  DNA  structures  without  the  use  of  a  sophisticated  setup 
(in  contrast  with  STED3  or  dual -objective  STORM16)  or  special¬ 
ized  experimental  conditions  such  as  dye-caging  approaches17. 
We  used  the  unique  programmability  of  DNA  molecules  to  per¬ 
form  sequential  multiplexing  (with  Exchange- PAINT)  using  only 
a  single  fluorescent  dye  and  obtained  the  first  ten-color  in  vitro 
super-resolution  image  on  DNA  nanostructures.  We  also  show 
the  applicability  of  Exchange -PAINT  to  cellular  imaging  by  dem¬ 
onstrating  four-color  imaging  of  protein  targets  in  fixed  cells  and 
three-color  3D  imaging, 

RESULTS 

Sub-10-nm  in  vitro  imaging  with  DNA- PAINT 

First  we  optimized  the  spatial  resolution  of  DNA-PAINT  relative 
to  that  of  earlier  DNA-PAINT  studies 1043-15  through  enhanced 
drift  correction  and  higher  localization  accuracy.  We  achieved 
the  latter  by  collecting  more  emitted  photons  per  binding  event 
through  optimization  of  fluorescence  on -times  and  camera 
integration  time  and  by  increasing  laser  excitation  intensities. 
Extracting  more  photons  per  binding  event  was  also  facilitated  by 
the  fact  that  DNA-PAINT  imaging  is  not  prone  to  photo  bleaching, 
as  imager  strands  are  continuously  replenished  from  solution10. 
This  replenishing  also  allows  nearly  100%  imaging  efficiency 
of  all  docking  sites10.  This  imaging  efficiency  of  docking  sites 
does  not  necessarily  translate  to  imaging  efficiency  of  targets,  as 
the  labeling  efficiency  of  targets  with  docking  strands  may  not 
be  100%,  However,  if  present  and  accessible,  every  docking  site 
should  eventually  be  imaged  during  data  acquisition10. 

We  evaluated  imaging  performance  by  visualizing  DNA  origami 
structures  that  mimic  in  vitro -assembled  microtubules  (Fig.  la). 
In  DNA  origami,  a  long  ssDNA  molecule  (the  'scaffold')  is  ‘folded' 
into  a  desired  shape  or  pattern  by  the  sequence-specific  bind¬ 
ing  of  hundreds  of  short  oligonucleotides  (the  ‘staple1  strands)  to 
designated  regions  on  the  scaffold15,19  (Supplementary  Fig.  1), 
The  DNA  origami  monomer  is  a  tunnel -like  structure  (with 
dimensions  of  -16  nm  x  -16  nm  x  -75  nm).  We  linked 
monomers  using  connector  strands  to  form  a  homopolymer 
mimicking  microtubules  (Fig,  1c),  which  formed  with  high  yield 


(Supplementary  Fig.  2),  To  allow  for  DNA-PAINT  imaging,  we 
extended  staple  strands  on  two  opposite  faces  of  the  structure 
by  adding  single -stranded  docking  sites  at  the  3'  end.  We  bound 
assembled  polymers  to  a  BSA-bio  tin-strep  tavidin  glass  surface 
using  biotinylated  staple  strands  extruding  from  the  bottom  of 
the  structure10,13,14,20.  DNA-PAINT  was  then  performed  using 
Cy3b-labeled  imager  strands. 

The  super- resolved  image  revealed  two  adjacent  lines 
spaced  ~16  nm  apart,  a  geometry  matching  that  of  the  designed 
microtubule-like  origami  (Fig.  Id).  The  cross-sectional  profiles 
of  two  regions  of  interest  possessed  well-separated  peaks  at  the 
designed  distance  with  a  full  width  at  half  maximum  (FWHM)  of 
-7-10  nm  (Fig,  leand  Supplementary  Fig,  3).  We  note  that  this 
resolution  was  obtained  in  standard  DNA  hybridization  buffer 
without  the  use  of  oxygen  scavengers21,  triplet -state  quenchers22 
or  redox  systems23. 

Straightforward  extension  to  multicolor  imaging  was  obtained 
by  coupling  spectrally  distinct  dyes  to  orthogonal  imager  strand 
sequences  as  reported  previously13  (Supplementary  Fig.  4),  The 
orthogonality  of  the  imager  strand  sequences  prevents  cross-talk 
between  different  color  channels  (Supplementary  Fig.  4b- d). 

Multiplexed  cellular  imaging  with  DNA-PAINT 

To  image  cellular  components  with  DNA-PAINT,  we  specifi¬ 
cally  labeled  protein  targets  using  antibodies  conjugated  with 
DNA  docking  strands  (Fig,  2).  We  formed  the  conjugates  by  first 
reacting  biotinylated  docking  strands  with  streptavidin  and  then 
incubating  with  biotinylated  antibodies.  We  first  immunostained 
fixed  HeLa  cells  using  a  preassembled  antibody-DNA  conjugate 
against  P-tubulin,  Atto  655-labeled  imager  strands  were  then 
introduced,  and  imaging  was  performed  using  HILO  micros¬ 
copy11.  The  resulting  super- resolution  images  showed  a  clear 
increase  in  spatial  resolution  over  that  of  the  diffraction -limited 
representation  (Fig.  2a^c),  A  cross-sectional  profile  yielded  a 
distance  of  -79  nm  between  two  adjacent  microtubules  with  an 
apparent  width  of  -47  and  ~44  nm  for  each  (Supplementary 
Fig.  5),  dimensions  consistent  with  those  in  earlier  reports24. 
We  observed  little  to  no  nonspecific  binding  of  imager  strands 
to  nonlabeled  cellular  components  (see  below  for  quantitative 
characterization  and  discussion  of  nonspecific  binding). 
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Figure  2  |  Spectrally  multiplexed  DN A- PAINT 
super- resolution  imaging  of  microtubules  and 
mitochondria  inside  fixed  ceLls.  (a)  DMA- PAINT 
super- resolution  image  of  microtubules  inside  a 
fixed  HeLa  ceLl  using  Atto  655-labeled  imager 
strands  (10,000  frames,  10-Hz  frame  rate). 

Inset  Labeling  and  imaging  schematic  for 
DMA- PAINT  in  a  cellular  environment. 

Microtubules  are  labeled  with  a  preassembled 
antibody-DNA  conjugate,  which  is  formed 
between  a  biotinylated  anti-tubuLin  antibody 
and  a  biotinyLated  DNA  docking  strand  (o)  using 
a  streptavidin  bridge,  a*  indicates  the 
complementary  imager  strand,  (b)  Zoomed-in 
view  of  the  boxed  area  in  a.  (e)  Diffraction-limited 
representation  of  the  area  in  b.  Arrows  in  b  and 
c  highlight  positions  where  the  increase  in 
resolution  of  the  DMA- PAINT  image  is  clearly 
visible.  Adjacent  microtubules  with  an  apparent 
width  of  -46  nm  at  position  i  are  separated  by 
-79  nm  (see  also  Supplementary  Fig.  5). 

(d)  Dual-color  DNA-PAINT  super-resolution  image 
{15,000  frames,  10-Hz  frame  rate)  of  microtubules 
and  mitochondria  inside  a  fixed  HeLa  cell  obtained  using  Cy3 b-labeled  imager  strands  (a*)  for  microtubules  (green)  and  orthogonal 
Atto  655-labeled  imager  strands  {&*)  for  mitochondria  (purple).  Inset,  labeling  and  imaging  schematic,  (e)  Zoomed-in  view  of  the  boxed  area 
in  d.  (f)  Diffraction-limited  image  of  the  area  in  e.  Scale  bars,  5  pm  (a,d)  and  1  pm  (b,c,e,f). 


We  achieved  multicolor  imaging  by  using  orthogonal  imager 
strands  coupled  to  spectrally  distinct  dyes.  We  labeled  microtubules 
in  a  fixed  HeLa  cell  with  a  preassembled  antibodv-DNA  conjugate 
carrying  a  docking  sequence  for  Cy3b-labeled  imager  strands, 
and  we  stained  mitochondria  using  a  second  antibody  linked  to 
an  orthogonal  sequence  for  Atto  655-labeled  imager  strands 
(Fig,  2d),  Although  Cy3b-  and  Atto  655-labeled  imager  strands 
coexisted  in  solution,  imaging  was  performed  sequentially,  Images 
were  drift  corrected,  and  we  aligned  different  color  channels  using 
gold  nanopartides  as  fiducial  markers  (Online  Methods),  The 
resulting  super- resolution  images  showed  a  clear  increase  in  spatial 
resolution  as  compared  to  the  diffraction -limited  representation, 
and  no  cross-talk  between  colors  was  observed  (Fig.  2d-f). 

Ten-color  super-resolution  imaging  with  Exchange-PAINT 

As  imager  strands  only  transiently  bind  to  the  docking  strands* 
DNA-PAINT  allows  for  a  new  multiplexing  approach  wherein 
orthogonal  imager  strands  are  sequentially  applied  to  the 
same  sample.  This  approach,  which  we  call  Exchange-PAINT, 
is  depicted  in  Figure  3a.  Initially,  different  target  species  are 
labeled  with  orthogonal  docking  strands.  Once  all  components 
are  labeled,  the  first  imager  strand  species  PI 4  (complementary 
to  docking  strands  PI)  is  introduced  and  a  DNA-PAINT  image 
is  acquired  only  for  the  targets  labeled  with  PI.  In  a  subsequent 
washing  step,  imager  strands  PI  *  are  removed  and  imager  strands 
P2*  are  introduced.  Another  image  for  only  P2  is  then  acquired.  In 
each  imaging  step,  the  respective  docking  sites  are  super- resolved 
and  a  unique  pseudocolor  is  assigned.  Washing  and  imaging  steps 
are  repeated  until  all  desired  targets  are  imaged.  These  images  are 
then  aligned  and  combined  to  produce  the  final  multicolor  image 
for  the  entire  sample. 

In  contrast  with  the  spectral  multiplexing  approach  described 
in  the  previous  section,  in  Exchange  -  PA  I  NT,  the  same  dye— and, 
hence,  the  same  laser — is  used  for  all  the  target  species.  As  such, 
multiplexing  is  limited  only  by  the  number  of  possible  orthogonal 


DNA  docking  sequences  instead  of  the  number  of  spectrally 
distinct  dyes  (a  typical  limit  for  most  previous  fluorescence 
imaging  methods). 

To  demonstrate  ten- color  super- resolution  imaging  of  DNA 
structures  using  Exchange -PA  I  NT,  we  designed  ten  unique  rec¬ 
tangular  DNA  origami  shapes13,  each  displaying  a  distinct  pattern 
of  orthogonal  docking  strands  that  resembles  a  digit  between  0 
and  9  (Fig,  3b),  After  surface  immobilization  of  all  ten  structures, 
we  performed  sequential  imaging  using  a  custom-made  fluidic 
chamber  (Supplementary  Fig.  6a)  for  easy  liquid  handling.  Ten 
orthogonal  imager  strands  (PI*  to  P 10*),  all  labeled  with  Cy3b, 
were  used  to  perform  Exchange  -  PA  I  NT,  The  resulting  digits  from 
all  ten  imaging  rounds  are  shown  in  Figure  3c.  Each  target  was 
resolved  with  high  spatial  resolution.  Cross- sectional  histograms 
along  the  bars  of  the  digits  showed  sub-10-nm  FWHM  of  the 
distributions  (data  not  shown).  Note  thaL  high  resolution  was 
maintained  for  all  digits,  as  the  same  optimized  dye  (Cv3h)  and 
imaging  conditions  were  used  in  each  cycle. 

A  combined  image  of  all  ten  rounds  (Fig.  3d  and  Supplementary 
Fig.  7)  demonstrates  specific  interaction  of  imager  strands  with 
respective  targets  with  no  observable  cross -La  lk  between  cycles. 
An  apparent  green  digit  5  instead  of  2  was  observed  (Fig.  3d).  This 
is  likely  not  a  falsely  imaged  digit  5  from  cross-talk  but  rather  a 
mirrored  digit  2  (Supplementary  Fig.  8).  A  mirrored  image  likely 
results  from  an  origami  immobilized  upside  down25,  with  docking 
strands  trapped  underneath  yet  still  accessible  to  imager  strands. 

The  fluidic  setup  is  designed  to  minimize  sample  movement 
by  decoupling  the  fluid  reservoir  and  syringe  from  the  actual 
flow  chamber  via  flexible  tubing.  To  avoid  sample  distortion, 
we  took  special  care  to  ensure  gentle  fluid  flow  during  washing 
steps.  To  verify  thaL  the  sample  indeed  exhibited  little  movement 
and  little  Lo  no  distortion,  we  performed  a  ten -round  Exchange- 
PAINT  experiment.  We  imaged  the  DNA  origami  for  digit  4  in  the 
first  round  and  reimaged  it  after  ten  rounds  of  buffer  exchange. 
The  total  sample  movement  (physical  movement  of  the  fluidic 
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Figure  3  |  Exchange-PAINT.  (a)  Exchange-PAINT  schematic  showing  sequential  imaging  of  multiple  targets  using  imager  strands  labeled  with  the  same 
fluorophore.  (b)  Schematic  of  a  DNA  origami  pattern  (70  nm  *  100  nm)  displaying  docking  strands  that  resemble  the  digit  "4."  (c)  Pseudocolor  images 
of  ten  different  origami  structures  displaying  digits  0-9  in  one  sample  with  high  resolution  (FWHM  of  bar-like  features  <10  nm)  and  specificity.  The 
images  were  obtained  using  only  one  fluorophore  (Cy3b)  through  ten  imaging-washing  cycles  (imaging:  7,500  frames  per  cycle.  5-Hz  frame  rate; 
washing:  1-2  min  per  cycle),  (d)  Combined  overview  image  of  all  ten  Exchange- PAX  NT  cycles,  demonstrating  specific  interaction  with  the  respective 
target  with  no  cross-talk  between  imaging  cycles,  (e)  Four-color  image  of  digits  0-3  that  are  all  present  on  the  same  DNA  origami  (10,000  frames  each, 
5-Ha  frame  rate;  schematic  at  the  bottom).  Scale  bars,  25  nm  (cre)  and  250  nm  (d). 


chamber  with  respect  to  the  objective)  was  less  than  2  pm,  which 
could  easily  be  corrected  using  fiducial  markers.  Normalized 
cross -correlation  analysis  for  select  structures  produced  a  correla¬ 
tion  coefficient  of  0.92  (Supplementary  Fig.  9),  a  result  indicative 
of  almost  no  sample  distortion  (see  also  the  Discussion  below). 

Finally,  using  Exchange- PAINT,  we  successfully  imaged  four 
different  digit  patterns  on  the  same  DNA  origami  structure  (Fig.  3e 
and  Supplementary  Fig.  10).  Thus,  Exchange -PA  I  NT  is  not  limited 
to  spatially  separate  species  and  can  resolve  subdiffraction 
patterns  on  the  same  structure  with  no  observable  cross-talk  or 
^  sample  distortion.  Aligning  images  from  different  Exchange- 
(jlP  PAINT  rounds  is  straightforward  using  DNA  origami -based  drift 
markers.  Additionally,  because  imaging  is  performed  using  the 
same  dye,  no  chromatic  aberration  needs  to  be  corrected  between 
imaging  rounds. 

Multiplexed  cellular  imaging  with  Exchange-PAINT 

We  next  demonstrated  multiplexed  in  situ  imaging  in  a  fixed 
HeLa  cell  with  Exchange-PAINT,  Figure  4a  shows  four-color 
super- resolution  images  obtained  by  sequential  imaging  using 
only  a  single  dye  (Atto  655),  Using  custom-made  fluidic  chambers 
(Supplementary  Fig.  6b),  we  obtained  super-resolution  images 
of  P-lubulin  in  microtubules,  COX  IV  in  mitochondria,  TGN46 
in  the  Golgi  complex  and  PMP70  in  peroxisomes.  Imaging  and 
washing  was  performed  in  a  similar  fashion  as  for  DNA  struc¬ 
tures.  Again,  we  saw  little  to  no  nonspecific  binding  of  the  imager 
strands  to  nonlabeled  components  (Supplementary  Fig.  11). 

To  quantify  possible  nonspecific  interactions  of  the  imager 
strands  with  cellular  components,  we  performed  Exchange-PAINT 
experiments  in  which  no  DNA- PAINT  docking  strands  were 
present  on  the  antibody-streptavidin  conjugates,  but  otherwise 
we  performed  the  labeling  and  imaging  process  as  described 
above  (Supplementary  Fig.  12).  We  observed  minimal  nonspecific 


interaction  of  the  imager  strands  (Atto  655  or  Cy3b)  with  the 
cellular  components  including  genomic  DNA.  Nonspecific  inter¬ 
actions  can  be  excluded  by  analyzing  their  blinking  behavior 
in  an  intensity -versus- time  trace13:  such  interactions  lead  to 
nonrepeating  localization  events  and  therefore  show  an  easily 
identifiable  blinking  signature  (a  non -exponential  distribution 
of  on-  and  off-times)  thaL  differs  from  that  of  specific  DNA 
hybridization  interactions  (Supplementary  Fig.  12). 

To  quantify  potential  sample  distortion  in  cellular  Exchange- 
PAINT  imaging,  we  performed  a  study  similar  to  that  for  the 
m  vitro  case.  Here  we  imaged  microtubules  and  mitochondria  in 
four  rounds  of  Exchange-PAINT  by  imaging  microtubules  firs! 
and  then  mitochondria  and  then  repeating  the  process.  We  then 
selected  a  region  of  interest  in  each  image  and  performed  a  nor¬ 
malized  cross-correlation  analysis.  We  obtained  cross-correlation 
coefficients  of  0.80  and  0.96  for  the  microtubule  and  mitochondria 
images,  respectively  (Supplementary  Fig.  13),  values  indicative 
of  minimal  sample  distortion. 

We  note  that,  even  In  a  super- resolution  image  without  liq¬ 
uid  exchange,  one  cannot  expect  100%  correlation  between  two 
consecutive  images  of  the  same  region  in  a  sample  owing  to  the 
stochastic  nature  of  the  image  formation.  To  see  this  effect,  one 
can  simply  split  a  super-resolution  raw  data  set  into  two  parts 
of  equal  length,  perform  a  stochastic  reconstruction  and  calcu¬ 
late  Lhe  normalized  cross-correlation  coefficient  for  these  two 
supposedly  equal  images.  We  performed  this  analysis  for  a  sub- 
region  of  the  microtubule  image  in  Figure  4b  and  obtained  a 
coefficient  of  0.88,  similar  to  the  Exchange-PAINT  case  of  0.80 
(Supplementary  Fig.  13). 

Finally,  we  used  optical  astigmatism  imaging26  27  to  demonstrate 
3D  Exchange-PAINT  super-resolution  imaging  in  a  fixed  HeLa 
cell.  We  labeled  microtubules,  mitochondria  and  peroxisomes 
and  obtained  three-color  3D  super- resolution  Exchange-PAINT 
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Figure  4  |  Multiplexed  20  artd  30  Exchange -PAINT  super-resoLution  innaging  in  fixed  cells*  Each  target  is  labeled  with  an  antibody  carrying  a  unique 
DNA-PAINT  docking  sequence.  20  and  30  imaging  are  performed  using  imager  strands  labeled  with  Atto  655  and  Cy3b,  respectively.  (ar  i)  Only  imager 
strands  o‘  are  present  and  microtubules  are  specifically  imaged*  (ar  ii)  After  imager  strands  a '  are  washed  out  imager  strands  b *  are  introduced  and 
COX  IV  proteins  in  mitochondria  are  imaged*  TGN46  proteins  in  the  Golgi  complex  (a,  hi)  and  PMP7Q  proteins  in  peroxisomes  (a.  iv)  are  imaged  in  the 
same  way  {imaging:  15,000  frames  per  cycle,  10- Hz  imaging  rater  washing:  1-2  min  per  cycle),  (a,  v)  Overlay  of  all  four  targets,  (b-g)  Three-color  30 
Exchange- PAINT  images  using  Cy3b-labeled  imager  strands,  (b)  30  image  of  microtubules;  color  indicates  height  on  a  scale  from  0  nm  (green)  to 
800  nm  (purple)*  (c)  30  image  of  mitochondria;  the  height  scale  is  40-1,100  nm.  (d)  3D  image  of  peroxisomes;  the  height  scale  is  0-700  nm. 

(e)  Zoomed -in  view  of  the  boxed  area  in  b;  the  height  scale  is  0-700  nm.  (f)  xz  profile  of  the  boxed  area  in  e,  (g)  A  two-component  Gaussian  fit 
reveals  a  distance  of  -109  nm  in  z  of  two  adjacent  microtubules.  For  3D  imaging:  50,000  frames  per  cycle,  16- Hz  imaging  rate;  washing:  1-2  min 
per  cycle.  Scale  bars:  5  pm  (a-d),  500  nm  (e)  and  100  nm  (f). 


images  using  Cy3b  (Fig.  4b-d  and  Supplementary  Fig.  14). 
A  magnified  image  revealed  separate  microtubules  crossing  each 
other  in  different  z  planes  (Fig.  4e,f)*  A  cross -sectional  histo¬ 
gram  in  z  yielded  a  distance  of -109  nm,  which  is  well  below  the 
diffraction  limit  (Fig*  4g). 

DISCUSSION 

Sequential  multiplexing,  as  that  achieved  with  Exchange -PAINT, 
provides  a  complementary  approach  to  previous  spectral  and 
geometric  multiplexing13  23.  Compared  to  geometric  multiplex¬ 
ing13,28,  Exchange -PAINT  uses  a  more  compact  labeling  entity  (an 
~9-nt  DNA  strand),  but  this  should  allow  up  to  several  hundred 
noninLeracting  sequences  wiLhin  tight  bounds  for  dissociation 
kinetics*  Exchange- PAINT  may  enable  a  substantial  increase 
in  multiplexing  for  both  super- resolution  and  diffraction- 
limited  imaging.  In  contrast  with  spectral  multiplexing23,29, 
Exchange-PAINT  uses  only  a  single  fluorophore,  thereby  allowing 
the  selection  of  an  optimal  dye  with  respect  to  its  photophysi- 
cai  properties  (such  as  the  number  of  emittable  photons)  for 


super- resolution  imaging.  Unlike  previous  multiplexing  approaches 
for  diffraction-limited  imaging  based  on  DNA  strand  exchange 
reactions30  and  in  situ  sequencing31,  Exchange-PAINT  does 
not  involve  labeling  and  erasing  steps,  as  imager  strands  bind  to 
docking  strands  only  transiently*  Exchange-PAINT  thus  permits 
simpler  experimental  procedures  and  faster  image  acquisition 
(-1-2  min  of  washing  between  imaging  cycles)  while  preserving 
the  intrinsic  super- resolution  capability  of  DNA-PAINT. 

Fully  translating  the  in  vitro  imaging  capability  of  DNA-PAINT 
and  Exchange-PAINT  to  in  situ  imaging  of  cellular  components 
will  benefit  from  strategies  to  label  diverse  cellular  targets  with 
high  specificity  and  efficiency*  One  way  to  facilitate  this  goal,  as 
noted  by  other  researchers1 7,24  f32,  is  reducing  the  label  size*  Our 
current  antibody- DNA  conjugation  method  based  on  streptavi- 
din  bridging,  though  providing  a  simple  and  modular  approach, 
results  in  a  ralher  bulky  conjugate  with  large  linker  distances  that 
preclude  obtaining  the  sub- 10mm  resolution  we  demonstrated 
by  direct  labeling  of  DNA  nanostructures  with  imager  strands* 
Thus  a  next  logical  step  would  be  direct  coupling  of  primary 
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antibodies  to  DMA  strands33  without  a  streptavidin  intermedi¬ 
ate.  Additionally,  to  further  decrease  the  label  size,  it  would  be 
advantageous  to  use  nanobody-  or  aptamer- based  labeling  strate- 
gies24i32>  with  the  latter  serving  as  a  natural  extension  to  DNA- 
PAINT  imaging  as  it  already  uses  nucleic  acid-based  interactions. 
In  addition  to  its  use  in  protein  imaging,  our  method  should  be 
directly  applicable  to  DNA  and  RNA  imaging  in  fixed  cells;  for 
example,  by  using  FISH  methods28'34. 

DNA- PAINT  and  Exchange- PAINT  provide  a  simple  and  robust 
method  for  highly  multiplexed  super- resolution  imaging.  With 
further  development,  we  anticipate  it  will  become  a  useful  and 
standard  tool  for  studying  complex  biomolecular  systems. 

METHODS 

Methods  and  any  associated  references  are  available  in  the  online 
version  of  the  paper. 

Note:  Any  Supplementary  Information  and  Source  Data  files  are  available  in  the 
online  version  of  the  paper : 
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ONLINE  METHODS 

Materials.  Unmodified  DNA  oligonucleotides  were  purchased 
from  Integrated  DNA  Technologies.  Fluorescently  modified  DNA 
oligonucleotides  were  purchased  from  Biosynthesis.  Biotinylated 
monoclonal  antibodies  against  (3- tubulin  (9F3;  catalog  number: 
6181)  and  COX  IV  (3 El  1;  catalog  number:  6014)  were  purchased 
from  Cell  Signaling,  Anti-PMP70  (catalog  number:  ab28499)  was 
purchased  from  Abeam,  Anti-TGN46  (catalog  number:  NBPL- 
4 9643 B)  was  purchased  from  VWR.  S  t  rep  t  avid  in  was  purchased 
from  Invitrogen  (catalog  number:  S-888).  Bovine  serum  albumin 
(BSA)  and  BSA-biotin  obtained  from  Sigma-Aldrich  (catalog 
number:  A8549).  Glass  slides  and  coverslips  were  purchased 
from  VWR.  Lab-Tek  11  chambered  coverglass  was  purchased  from 
Thermo  Fisher  Scientific,  M13mpl8  scaffold  was  obtained  from 
New  England  BioLabs.  p8064  scaffold  for  micro  tubule  dike  DNA 
origami  structures  was  prepared  as  described  before19.  Freeze  *N 
Squeeze  columns  were  ordered  from  Bio- Rad,  TetraSpeck  Beads 
were  purchased  from  Life  Technologies,  Paraformaldehyde,  glu- 
taraidehvde  and  TEM  grids  (FORMVAR400  mesh  copper  grids) 
were  obtained  from  Electron  Microscopy  Sciences, 

Three  buffers  were  used  for  sample  preparation  and  imaging; 
buffer  A  (10  mM  Tris-HCL  100  mM  NaCU  0.05%  Tween  20, 
pH  7,5),  buffer  B  (5  mM  Tris-HCU  10  mM  MgCI2*  1  mM 
EDTA,  0.05%  Tween  20,  pH  8)  and  buffer  C  (lx  PBS,  500  mM 
NaCl,  pH  8). 

Optical  setup.  Fluorescence  imaging  w'as  carried  out  on  an 
inverted  Nikon  Eclipse  Ti  microscope  (Nikon  Instruments)  with 
the  Perfect  Focus  System,  applying  an  objective -type  TIRF  con¬ 
figuration  using  a  Nikon  TIRE  illuminator  with  an  oil- immersion 
objective  (CFI  Apo  TIRF  lOGx,  numerical  aperture  (NA)  L49s 
oil).  For  2D  imaging  an  additional  L5x  magnification  was  used 
to  obtain  a  final  magnification  of  -150- fold,  corresponding  to 
a  pixel  size  of  107  nm.  Three  lasers  were  used  for  excitation: 
488  nm  (200  mW  nominal.  Coherent  Sapphire),  561  nm  (200  mW 
nominal.  Coherent  Sapphire)  and  647  nm  (300  mW  nominal, 
^  MBP  Communications).  The  laser  beam  was  passed  through 
cleanup  filters  (ZT488/  ID,  ZET561/10  and  ZET640/20,  Chroma 
Technology)  and  coupled  into  the  microscope  objective  using 
a  multiband  beam  splitLer  (ZT488rdc/ZT56Lrdc/ZT640rdc, 
Chroma  Technology).  Fluorescence  light  was  spectrally  filtered 
with  emission  filters  (ET525/50m,  ET6G0/50m  and  ET700/75m, 
Chroma  Technology)  and  imaged  on  an  electron -multiplying 
charge -coupled  device  (EMCCD)  camera  (iXon  X3  DU-897* 
Andor  Technologies), 

DNA  origami  self-assembly.  The  micro  tubule -like  DNA  origami 
structures  were  formed  in  a  one- pot  reaction  with  a  40- pi  Lotal 
volume  containing  10  nM  scaffold  strand  (p8064),  500  nM  folding 
staples  and  biotin  handles*  750  nM  biotin  anti -handles  and  1. 1  jiM 
DNA-PAINT  docking  strands  in  folding  buffer  (lx  TAE  buffer 
with  20  mM  MgClj).  The  solution  was  annealed  using  a  thermal 
ramp13  cooling  from  80  “C  to  14  UC  over  the  course  of  15  h.  After 
self-assembly,  monomeric  structures  were  purified  by  agarose 
gel  electro  phoresis  (1.5%  agarose*  Q.5x  TEE,  10  mM  MgCl3,  lx 
SybrSafe)  at  4,5  V/cm  for  1.5  h  (see  Supplementary  Fig.  2).  Gel 
bands  were  cut,  crushed  and  filled  into  a  Freeze  £N  Squeeze  column 
and  spun  for  5  min  at  l,00Gg at  4  °C.  Polymerization  was  carried 
out  at  30  °C  for  48  h  with  a  fivefold  excess  of  polymerization 


staples  in  folding  buffer.  Polymerized  structures  were  used  for 
imaging  without  further  purification. 

DNA  origami  drift  markers  were  self-assembled  in  a  one-pot 
reaction  (40 -til  total  volume,  20  nM  M13mpl8  scaffold,  100  nM 
biotinylated  staples,  530  nM  staples  with  DNA-PAINT  docking 
sites,  lx  TAE  with  1 2.5  mM  MgCl2).  Self-assembled  structures 
were  purified  as  described  before. 

DNA  origami  structures  for  the  four-color  in  vitro  Exchange - 
PAINT  demonstration  were  self-assembled  in  a  one-pot  reaction 
(40-jll  total  volume,  30  nM  M13mp  18  scaffold,  470  nM  biotinylated 
staples,  400  nM  staples  with  docking  sites  for  number  imaging, 
370  nM  core  structure  staples,  lx  TAE  with  12.5  mM  MgCl2), 
Self-assembled  structures  were  purified  as  described  before. 

DNA  origami  structures  for  the  ten -color  in  vitro  Exchange- 
PAINT  demonstration  were  self-assembled  in  a  one-pot  reaction 
(40-pl  total  volume,  30  nM  MI  3mpl8  scaffold,  36  nM  biotinylated 
staples,  750  nM  staples  with  docking  sites  for  number  imaging, 
300  nM  core  structure  staples,  lx  TAE  with  12.5  mM  MgCD)- 
Structures  were  not  purified.  Excessive  staples  were  washed  out  of 
the  sample  after  immobilization  of  the  structure  on  the  surface, 

DNA  strand  sequences  for  the  microtubule-like  DNA  ori¬ 
gami  structures  can  be  found  in  Supplementary  Table  1.  DNA 
strand  sequences  for  DNA  origami  drift  markers  can  be  found  in 
Supplementary  Table  2.  DNA  strand  sequences  for  DNA  origami 
structures  for  the  ten-color  in  vitro  Exchange- PAINT  demonstra¬ 
tion  can  be  found  in  Supplementary  Tables  3  and  4  for  odd  and 
even  digits,  respectively.  DNA  strand  sequences  for  DNA  origami 
structures  form  vitro  Exchange -PAINT demonstration  (digits 0-3) 
can  be  found  in  Supplementary  Table  5,  The  scaffold  sequence 
for  p8064  and  M  Bmp  18  can  be  found  in  Supplementary  Tables  6 
and  7,  respectively. 

DNA-PAINT  imager  and  docking  sequences  as  well  as  sequences 
for  surface  attachment  via  biotin  are  listed  in  Supplementary 
Table  8. 

Antibody-DNA  conjugates.  Antibody- DNA  conjugates  used  to 
specifically  label  proteins  of  interest  with  DNA-PAINT  docking 
sites  were  preassembled  in  two  steps.  First,  3.2  pi  of  1  mg/ml 
streptavidin  (dissolved  in  buffer  A)  was  reacted  with  0.5  pi  bioti¬ 
nylated  DNA-PAINT  docking  strands  at  100  jiM  and  an  additional 
5.3  |il  of  buffer  A  for  30  min  at  room  temperature  (RT)  while  gen¬ 
tly  shaking.  The  solution  was  then  incubated  in  a  second  step  with 
1  Jll  of  monoclonal  biotinylated  antibodies  at  1  mg/ml  against 
the  protein  of  interest  for  30  min  at  RT.  Filter  columns  (Amicon 
100  kDa,  Millipore)  were  used  to  purify  the  preassembled 
conjugates  from  unreacted  streptavidm-oligo  conjugates. 

Cell  immunostaining,  HeLa  and  DLD1  cells  were  cultured 
with  Eagle’s  minimum  essential  medium  fortified  with  10%  PBS 
with  penicillin  and  streptomycin  and  were  incubated  at  37  °C 
with  5%  C02.  At  approximately  30%  confluence,  cells  were 
seeded  into  Lab-Tek  II  chambered  coverglass  24  h  before  fixation. 
Microtubules,  mitochondria,  Golgi  complexes  and  peroxisomes 
were  immunostained  using  the  following  procedure:  washing  in 
PBS;  fixation  in  a  mixture  of  3%  paraformaldehyde  and  0.1%  glu- 
taraldehyde  in  PBS  for  10  min;  3x  washing  with  PBS;  reduction 
with  ~  1  iYig/ml  NaBH4  for  7  min;  3x  washing  with  PBS;  per- 
meabilization  with  0.25%  (v/v)  Triton  X-100  in  PBS  for  10  min; 
3x  washing  with  PBS;  blocking  with  3%  (w/v)  BSA  for  30  min 
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and  staining  overnight  with  the  preassembled  antibody-DNA 
conjugates  against  fi- tubulin,  COX  IV,  PMP70  or  TGN46  (con¬ 
jugates  were  diluted  to  10  pg/ml  in  5%  BSA);  3x  washing  with 
PBS;  postfixation  in  a  mixture  of  3%  paraformaldehyde  and  0.1% 
glutaraldehyde  in  PBS  for  L0  min;  and  3x  washing  with  PBS. 

Super-resolution  DNA-PAINT  imaging  of  microtubule-like 
DNA  origami  structures.  For  sample  preparation,  a  piece  of 
coverslip  (no.  1.5*  IS  x  IS  mm2,  -0,17  mm  thick)  and  a  glass 
slide  (3  x  1  inch2*  1  mm  thick)  were  sandwiched  together  by  two 
strips  of  double-sided  tape  to  form  a  flow  chamber  with  inner 
volume  of  -20  Jil.  First,  20  |il  of  biotin-labeled  bovine  albumin 
( 1  mg/ml*  dissolved  in  buffer  A)  was  flown  into  the  chamber  and 
incubated  for  2  min.  The  chamber  was  then  washed  using  40  pi 
of  buffer  A.  20  ul  of  streptavidin  (0.5  mg/ml*  dissolved  in 
buffer  A)  was  then  flown  through  the  chamber  and  allowed 
to  bind  for  2  min.  After  washing  with  40  pi  of  buffer  A  and 
subsequently  with  40  pi  of  buffer  B*  20  pi  of  biotin -labeled 
microtubule- like  DNA  structures  (-300  pM  monomer  concen¬ 
tration)  and  DNA  origami  drift  markers  (-100  pM)  in  buffer  B 
were  finally  flown  into  the  chamber  and  incubated  for  5  min.  The 
chamber  was  washed  using  40  pi  of  buffer  B. 

The  final  imaging  buffer  solution  contained  1.5  nM  Cy3b- 
labeled  imager  strands  in  buffer  B.  The  chamber  was  sealed  with 
epoxy  before  subsequent  imaging.  The  CCD  readout  bandwidth 
was  set  to  1  MHz  at  16  bit  and  5,1  pre-amp  gain.  No  electron- 
multiplying  (EM)  gain  was  used.  Imaging  was  performed  using 
TIR  illumination  with  an  excitation  intensity  of  294  W/cm2 
at  56 1  nm. 


used  600  pM  Cy3b- labeled  imager  strands  and  1.5  nM  Atto  655- 
labeled  imager  strands  in  buffer  C. 

Super- resolution  Exchange- PA  I  NT  imaging  of  cells.  A  Lab- 
Tek  II  chamber  was  adapted  for  fluid  exchange  as  shown  in 
Supplementary  Figure  6b.  2D  linages  (Fig.  4a*  i-iv)  were 
acquired  with  an  EMCCD  readout  bandwidth  of  5  MHz  at  14  bit, 

5. 1  pre-amp  gain  and  255  EM  gain.  3D  images  (Fig.  4b-d)  were 
acquired  with  a  CCD  readout  bandwidth  of  3  MHz  at  154  bit* 

5. 1  pre-amp  gain  and  no  EM  gain.  Imaging  was  performed  using 
HILO  illumination  in  both  cases.  Sequential  imaging  was  done 
as  described  for  the  2D  origami  nanostructures*  but  the  washing 
steps  were  performed  using  buffer  Q  The  laser  power  densities 
at  647  nm  were  257  W/cm2  in  Figure  4a*  i  and  385  W/cm2 
in  Figure  4a*  ii-iv.  The  laser  power  densities  at  561  nm  were 
31  W/cm2  in  Figure  4b-d. 

Imaging  conditions  were  as  follows.  For  Figure  4a*  i  we  used 
700  pM  Allo  655- labeled  imager  strands  in  buffer  C.  For 
Figure  4a,  ii-iv  we  used  2  nM  Atto  655- labeled  imager  strands 
in  buffer  C.  For  Figure  4b  we  used  800  pM  Cy3b-labeled  imager 
strands  in  buffer  C  For  Figure  4c, d  we  used  2  nM  Cy3b-labeled 
imager  strands  in  buffer  C 

3D  DNA- PAINT  imaging.  3D  images  were  acquired  with  a  cylin¬ 
drical  lens  in  the  detection  path  (Nikon).  The  N -STORM  analysis 
package  for  NIS  Elements  (Nikon)  was  used  for  data  processing. 
Imaging  was  performed  without  additional  magnification  in  the 
detection  path*  yielding  160-nm  pixel  size.  3D  calibration  was 
carried  out  according  to  the  manufacturers  instructions. 


Super- re  solution  Exchange- PA  I  NT  imaging  of  DNA  nanostruc¬ 
tures.  For  fluid  exchange*  a  custom  flow  chamber  was  constructed 
as  shown  in  Supplementary  Figure  6a.  A  detailed  preparation 
protocol  can  be  found  in  the  Supplementary  Protocol.  Prior  to 
the  functionalizing  of  the  imaging  chamber  with  BSA-biotm*  it 
was  rinsed  with  1  M  KOH  for  cleaning.  Binding  of  the  origami 
structures  to  the  surface  of  the  flow  chamber  was  performed  as 
described  before.  Each  image  acquisition  step  was  followed  with 
a  brief- 1-2  min  washing  step  consisting  of  at  least  three  washes 
using  200  pi  of  buffer  B  for  each.  Then  the  next  Imager  strand 
solution  was  introduced.  The  surface  was  monitored  throughout 
the  washing  procedure  to  ensure  complete  exchange  of  imager 
solutions.  Acquisition  and  washing  steps  were  repeated  until  all 
ten  targets  were  imaged.  The  CCD  readout  bandwidth  was  set 
to  3  MHz  at  14  bit  and  5.1  pre-amp  gain.  No  EM  gain  was  used. 
Imaging  was  performed  using  TIR  illumination  with  an  excitation 
intensity  of  166  W/cm2  at  561  nm  (ten-color  Exchange -PAINT 
with  3  nM  Cy  3b -labeled  imager  strands  in  buffer  B;  Fig,  3c*d)  and 
600  W/cm2  at  647  nm  (four-color  Exchange- PA  I  NT  with  3  nM 
Atto  655-labeIed  imager  strands  in  buffer  B;  Fig.  3e). 

Super-resolution  DNA- PAINT  imaging  of  cells.  All  data  were 
acquired  with  an  EMCCD  readout  bandwidth  of  5  MHz  at  14  bit* 

5. 1  pre-amp  gain  and  255  EM  gain.  Imaging  was  performed  using 
HILO  illumination 1 L  The  laser  power  densities  were  283  W/cm2 
at  647  nm  in  Figure  2a  and  142  W/cm3  at  647  nm  and  19  W/cm2 
at  561  nm  in  Figure  2d. 

Imaging  conditions  were  as  follows.  For  Figure  2a  we  used  700  pM 
Atto  6 5 5- labeled  imager  strands  in  buffer  C,  For  Figure  2d  we 
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Imager  strand  concentration  determination.  Optimal  imager 
concentrations  were  determined  empirically  according  to  the 
labeling  density.  Generally*  a  high  enough  fluorescence  off-on 
ratio  has  to  he  ensured  in  order  to  guarantee  binding  of  only  a 
single  imager  strand  per  diffraction-limited  area.  Additionally*  a 
sufficient  imager  strand  concentration  (and  thus  sufficiently  low 
fluorescence  off- time)  is  necessary  to  ensure  sufficient  binding 
events  and  thereby  robust  detection  of  every  docking  strand  dur¬ 
ing  image  acquisition. 

Super-resolution  data  processing.  Super-resolution  DNA- 
PAINT  images  were  reconstructed  using  spot- finding  and 
2D -Gaussian  fitting  algorithms  programmed  in  Lab  VIEW 10 
(Supplementary  Software).  A  simplified  version  of  this  software 
is  available  for  download  at  lUtp:// www.dna-paini.net/  or  hup:// 
m  ol  ec  ul  ar-  sys  te  ms .  net/ s  oft  ware/. 

Normalized  cross-correlation  analysis.  Normalized  cross¬ 
correlation  coefficients  were  obtained  by  first  normalizing  the 
respective  reconstructed  grayscale  super- resolution  images  and 
subsequently  performing  a  cross-correlation  analysis  in  Matlab 
R20I3b  {Math Works), 

Drift  correction  and  channel  alignment.  DNA  origami  struc¬ 
tures  (Supplementary  Fig.  1)  were  used  for  drift  correction  and 
as  alignment  markers  in  in  vitro  DNA- PAINT  and  Exchange- 
PAINT  imaging.  Drift  correction  was  performed  by  tracking  the 
position  of  each  origami  drift  marker  throughout  the  duration 
of  each  image  acquisition.  The  trajectories  of  all  detected  drift 
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markers  were  then  averaged  and  used  to  globally  correct  the 
drift  in  the  final  super-resolution  reconstruction.  For  channel 
alignment  between  different  imaging  cycles  in  Exchange -PA  I  NT, 
these  structures  were  used  as  alignment  points  by  matching  their 
positions  in  each  Exchange- PAINT  image. 

For  cellular  imaging,  100- run- diameter  gold  nanoparticles 
(Sigma  -Aldrich;  10  nM  in  buffer  G,  added  before  imaging)  were 
used  as  drift  and  alignment  markers.  The  gold  nanoparticles 
adsorb  nonspecifically  to  the  glass  bottom  of  the  imaging 
chambers.  Drift  correction  and  alignment  was  performed  in  a 
fashion  similar  to  that  for  the  origami  drift  markers.  Again,  the 
apparent  movement  of  all  gold  nanopartides  in  a  field  of  view 
was  tracked  throughout  the  image  acquisition.  The  obtained 
trajectories  were  then  averaged  and  used  for  global  drift  cor¬ 
rection  of  the  final  super-resolution  image.  For  the  dual-color 
image  of  mitochondria  and  microtubules  in  Figure  2d~f,  the 
gold  particles  were  visible  in  both  color  channels*  The  same  gold 
nanopartides  were  also  used  for  drift  correction  and  realign¬ 
ment  of  the  different  imaging  rounds  in  the  in  situ  Exchange - 
PAINT  experiments  (Fig,  4), 

Transmission  electron  microscopy  (TEM)  imaging.  For  imag¬ 
ing,  3.5  pi  of  undiluted  microtubule-like  DNA  structures  were 
adsorbed  for  2  min  onto  glow- discharged,  carbon-coated  TEM 


grids*  The  grids  were  then  stained  for  10  s  using  a  2%  aqueous 
ultrafiltrated  (0*2 -pm  filter)  uranyl  formate  solution  containing 
25  mM  NaOH.  Imaging  was  performed  using  a  JEOL  JEM- 1400 
operated  at  80  kV. 

Atomic  force  microscopy  imaging.  Imaging  was  performed 
using  tapping  mode  on  a  Multimode  VIII  atomic  force  micro¬ 
scope  (AFM)  with  an  E -scanner  (Rruker).  imaging  was  performed 
in  TAE/Mg2+  buffer  solution  with  DNP-S  oxide -sharpened  silicon 
nitride  cantilevers  and  SNL  sharp  nitride  levers  (Broker  Probes) 
using  resonance  frequencies  between  7  and  9  kHz  of  die  narrow 
1 00 -pm,  0. 38 -N/m- force  constant  cantilever*  After  self-assembly 
of  the  origami  structure,  -20  pi  of  TAE/Mg2+  buffer  solution 
was  deposited  onto  a  freshly  cleaved  mica  surface  (Ted  Pella) 
glued  to  a  metal  puck  (Ted  Pella).  After  30  s  the  mica  surface 
was  dried  using  a  gentle  stream  of  N2,  and  5  pi  of  the  origami 
solution  was  deposited  onto  the  mica  surface.  After  another  30  s, 
30  pi  of  additional  buffer  solution  was  added  to  the  sample. 
Imaging  parameters  were  optimized  for  best  image  quality  while 
the  highest  possible  set  point  was  maintained  to  minimize  dam¬ 
age  to  the  samples.  Images  were  postprocessed  by  subtracting  a 
first-order  polynomial  from  each  scan  line.  Drive  amplitudes  were 
approximately  0*11  V,  integral  gains  were  -2  and  proportional 
gains  were  ~4* 
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ABSTRACT:  Nucleic  acids  have  emerged  as  effective 
materials  for  assembling  complex  nanoscale  structures.  To 
tailor  the  structures  to  function  optimally  for  particular 
applications,  a  broad  structural  design  space  is  desired.  Despite 
the  many  discrete  and  extended  structures  demonstrated  in  the 
past  few  decades,  the  design  space  remains  to  be  fully  explored. 

In  particular,  the  complex  finite -sized  structures  produced  to 
date  have  been  typically  based  on  a  small  number  of  structural 
motifs.  Here,  we  perform  a  comprehensive  study  of  the  design 

space  for  complex  DMA  structures,  using  more  than  30  distinct  motifs  derived  from  single-stranded  tiles.  These  motifs  self- 
assemble  to  form  structures  with  diverse  strand  weaving  patterns  and  specific  geometric  properties,  such  as  curvature  and  twist. 
We  performed  a  systematic  study  to  control  and  characterize  the  curvature  of  the  structures,  and  constructed  a  Hat  structure  with 
a  corrugated  strand  pattern.  The  work  here  reveals  the  broadness  of  the  design  space  for  complex  DNA  nanostructures. 


\.j  \.j 


m  INTRODUCTION 

Self-assembly  of  nucleic  acids  (DNA  and  RNA)  provides  a 
powerful  approach  for  constructing  sophisticated  synthetic 
molecular  structures  and  devices.  By  encoding  sequence 
complementarity  into  component  DNA  strands,  prescribed 
structures  can  be  assembled  under  the  appropriate  formation 
conditions.1  After  three  decades  of  development,  the  complex¬ 
ity  of  synthetic  DNA  structures  has  grown  from  simple 
branched  junctions1  formed  from  just  a  few  strands  to  complex 
2D  and  3D  objects  composed  of  hundreds  or  even  thousands 
of  distinct  strands.2^22  Moreover,  researchers  have  demon¬ 
strated  the  construction  of  dynamic  systems,  including 
switches/3  walkers/4-26  circuits/'  *27"28  and  triggered  assembly 
systems.25  Additionally,  as  diverse  functional  mole  ail  es  can  be 
modified  onto  specific  sites  of  DNA  structures/9'19-31  DNA 
nanotechnology  has  enabled  applications  such  as  nano- 
fabrication/3  protein  structure  determination,34  fluorescent 
bioimaging,31'3*  single  molecule  biophysics/3  biosynthetic  and 
cell-signaling  pathways  modulation.936 

Two  methods  that  are  particularly  effective  for  assembling 
discrete  mega -Dalton  structures  with  arbitrarily  prescribed 
shapes  are  DNA  origami,1014  L*r“  and  single-stranded  tiles 
(SSTs)20  and  bricks."1  In  DNA  origami,  hundreds  of  short, 
synthetic  DNA  strands  fold  a  long  scaffold  (topically  the  M13 
viral  genome)  into  a  desired  structure.10"16-1  *  *  More  recently, 
researchers  have  demonstrated  finite  complex  2D  and  3D 
shapes  self-assembled  from  hundreds  to  thousands  of  distinct 
single -stranded  tiles  and  bricks/0'*1 

Unlike  DNA  origami,  no  scaffold  strand  is  required  for  SST 
structure,  which  are  composed  entirely  of  short  synthetic  DNA 
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strands.  The  elimination  of  the  scaffold  strand  makes  S STs 
effective  tools  for  systematic  and  rapid  study  of  the  geometry, 
sequence,  and  structural  design  space  for  complex  DNA 
structures.  First,  as  SST-based  structures  have  a  modular 
architecture — -each  strand  can  be  included,  removed,  or 
replaced  independently- — SSTs  enable  rapid  prototyping  of 
the  shape  space  (more  than  100  distinct,  sophisticated  2D20 
and  3D"1  shapes  were  experimentally  demonstrated).  Second, 
as  the  strand  sequences  are  no  longer  restricted  to  be  derived 
from  the  biogenic  scaffold  sequence,  SSTs  enable  rapid  probing 
of  the  sequence  space  (structures  were  formed  from  both 
specifically  designed  and  randomly  generated  sequences20'21). 
Lastly  and  importantly,  by  removing  the  restriction  of  scaffold 
routing  through  the  shape  and  using  diverse  structural  motifs 
derived  from  the  canonical  SST  motif/0  we  will  demonstrate 
here  a  systematic  study  of  the  structural  design  space  for 
forming  complex  DNA  nanostructures. 

To  tailor  the  structures  to  function  optimally  for  particular 
applications,  a  broad  design  space  for  DNA  nanostructures  is 
desired.  Despite  the  many  discrete  and  extended  structures 
demonstrated  in  the  past  few  decades,  the  structural  design 
space  has  yet  to  be  fully  explored.  In  particular,  complex,  finite¬ 
sized  structures  formed  by  origami  and  SST  are  primarily  based 
on  double-  or  single -stranded  versions  of  the  basic  DAE 
(antiparallel  double-crossover  molecules  with  an  even  number 
of  half-helical  turns)  motif3  and  their  derivatives.  Beyond 
origami  and  SST,  however,  a  richer  motif  space  has  been 
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Figure  t.  Motif  designs  and  AFM  imaging.  A  representative  subset  of  the  motifs  studied  is  shown  here;  see  SI,  section  311.3  for  the  hill  repertoire, 
(a)  A  2 XI  unit  (containing  four  SSTs)  of  the  canonical  motif  (motif  0).  Colors  distinguish  sequences,  (b)  Strand  diagram  (left),  helix  diagram 
(middle),  and  cylinder  diagram  (right)  of  a  10  helix  X  11  turn  (JOHxllT)  structure  formed  from  motif  0.  (c)  Cylinder  diagrams  for  structures 
formed  from  different  motifs;  a  cylinder  depicts  a  DNA  duplex  and  a  line  depicts  a  single-strand  DNA  segment:  (cl)  structures  from  motifs  0—2  and 
4—10  have  a  rectangle  shape  containing  parallel-arranged  double-helices;  (c2)  structures  from  motifs  3+l  and  3,2  have  a  parallelogram  shape 
containing  parallel -arranged  double- helices;  (c3)  structures  from  motif  11  have  a  rectangle  shape  containing  parallel-arranged  doubi e-helices 
connected  by  single-stranded  linkers;  (c4)  structures  from  motif  12  have  a  rectangle  shape  with  a  "fishnet'1  pattern  (composed  of  short  duplex 
segments  connected  by  single-stranded  linkers);  (c5)  structures  from  motifs  13  and  14  have  a  rectangle  shape  with  a  honeycomb  pattern  composed 
of  short  DNA  duplex  segments,  (d)  Strand  diagrams  and  the  corresponding  AFM  images  for  structures  formed  from  different  motifs.  Top  panel:  a 
2X2  repeating  unit.  Numbers  indicate  domain  or  linker  lengths  in  nucleotide.  Bottom  panel:  AFM  image.  Inset  shows  a  magnified  view.  Scale  bars, 
50  nm.  See  SI,  section  SI1.5  for  the  measured  dimensions  and  measured  gel  yields  (5—44%). 
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explored  for  forming  relatively  simple  discrete  and  extended 
structures.  Tills  has  been  achieved  using  various  structural 
motifs,  such  as  diverse  double 'Crossover  motifs/*4  PX-based 
structures/ 3  structures  with  flexible  linker  joints/ l'14fI5j29r3B 
wire -frame  structures/ f2f7jS'  11,33-41  and  metal— DN  A  junc¬ 
tions,42 

The  next  challenge  is  to  develop  a  versatile  method  for 
creating  complex,  discrete-sized  DNA  nanostructures  with 
diverse  structural  motifs,  strand  weaving  patterns,  and  desired 
structural  properties.  Building  on  the  richness  of  the  previous 
structural  motifs  for  building  DNA  nanostructures,  and  taking 
advantage  of  the  designability  and  flexibility  of  SST-based 
assembly,  we  performed  a  comprehensive  study  of  the  design 
space  for  constructing  complex,  finite-sized  DNA  structures. 
We  tested  more  than  30  distinct  motifs  derived  from  SSTs/° 
Most  motifs  self-assembled  to  form  structures  with  diverse 
strand- weaving  patterns  and  specific  geometric  properties,  such 
as  curvature  and  twist.  Moreover,  we  performed  a  systematic 
study  to  control  and  characterize  the  curvature  of  the 
structures,  and  constructed  a  flat  structure  with  a  corrugated 
strand  pattern.  Our  success  in  assembling  a  myriad  of  different 
motifs  demonstrates  the  robustness  and  versatility  of  the  SST 
approach.  More  generally,  it  reveals  the  broadness  of  the  design 
space  for  constructing  complex  DNA  nanostructures. 

■  RESULTS  AND  DISCUSSIONS 

Motif  Design  Parameters.  We  began  our  study  for  the 
structural  design  space  with  the  canonical  U-shaped  SST  motif 
(motif  0,  Figure  la,b)  reported  in  our  previous  work.12'"0  This 
42 -base  SST  motif  has  four  consecutive  domains  (each 
measuring  10  or  11  nt). 12,20  By  pairing  up  complementary 
domains,  individual  tiles  self-assemble  into  DNA  lattices 
composed  of  parallel -arranged  double-helices  connected  by 
periodic  single-stranded  crossover  linkages.  Note  that  the 
linkage  is  merely  a  phosphate  in  the  DNA  backbone.  In  the 
diagrams  in  Figure  1,  the  linkage  is  artificially  stretched  to  more 
clearly  show  the  strand- weaving  pattern.  Starting  with  this 
simple  U-shaped  motif,  we  systematically  explored  important 
parameters  for  motif  design.  Below,  we  first  give  an  overview  of 
some  important  parameters  and  their  putative  effects  on  the 
geometrical,  mechanical,  and  thermodynamic  properties  of  the 
structures.  In  the  next  section,  we  describe  the  motif  design  and 
its  implementation,  in  detail. 

Domain  Length.  Motif  0  contains  10  and  II  nt  domains. 
For  most  of  the  motifs  explored  in  our  study  (including  motif 
0),  we  tested  several  different  domain  lengths.  Changing 
domain  length  could  affect  (l)  the  geometrical  properties  of  the 
assembled  structure,  such  as  curvature*  13  and  twist/  43,44  (2) 
thermodynamic  properties,  such  as  formation  and  melting 
temperature ;  and  (3)  mechanical  properties,  such  as  structural 
rigidity  and  internal  strain.1  7,45  Domain  length  is  thus  a  basic 
design  variable  that  we  explored  extensively  in  our  study. 
Changing  domain  length  also  directly  affects  the  crossover 
patterns  described  below. 

Crossover  Type.  Structures  formed  from  motif  0  possess 
single -stranded,  anti  parallel  crossovers.  In  our  study,  we 
explored  four  combinations  of  single-12,44  vs  double- stranded 
crossovers,  with  parallel  vs  antiparallel  orientations.3  Here  we 
use  “parallel"  (or  "antiparallel ’)  to  describe  crossovers  in  which 
the  orientation  of  the  strand  forming  the  crossover  is  the  same 
(or  opposite)  on  both  sides  of  the  crossover.3  The  crossover 
type  directly  affects  the  weaving  patterns  of  the  strands  and  the 
geometrical  properties  of  the  structure,  which  may  in  turn  affect 


their  mechanical  and  thermodynamic  properties.  For  example, 
geometrically,  the  parallel  or  antiparallel  crossovers  result  in 
structures  composed  of  double-helices  that  are  connected  in  a 
parallel  or  antiparallel  fashion,  referred  to  as  paruJ/d-cumiecfed 
doublc-hdiccs  or  antiparallel-conmcted  doubk-hdices.  Note  that 
we  also  use  the  term  parallel-arranged  double-hdices  to  refer  to 
the  double-helices  connected  by  either  parallel  or  antiparallel 
crossovers.  In  the  antiparallel  crossover-based  structure,  ail  the 
crossovers  display  the  major  grooves  on  the  same  side  of  the 
structure4  and  as  a  result,  induce  an  accumulated  curvature  for 
the  structure.  In  contrast,  the  parallel  crossover-based  structure 
avoids  such  asymmetry  and  curvature.  In  the  antiparallel 
crossover-based  structure,  the  major  groove  of  one  double-helix 
faces  the  minor  groove  of  an  adjacent  double-helix;3  in  contrast, 
the  parallel  crossover-based  structure  has  major— major  and 
minor^minor  groove  pairings.  Hence,  the  former  should  have 
less  strain  and  more  thermal  stability.3  Compared  with  double  - 
stranded  crossover,  single- stranded  cross  over-based  structures 
may  be  structurally  less  rigid,  mechanically  less  strained,  and  in 
turn,  thermodynamically  more  stable. 

Crossover  Pattern .  Motif  0  has  a  21  nt  crossover  spacing  for 
crossovers  between  the  two  adjacent  double -helices;  along  one 
particular  double -helix,  a  crossover  appears  every  10  or  11  nt. 
We  systematically  changed  crossover  patterns.  Crossover 
patterns  could  also  lead  to  a  combination  of  geometric, 
thermodynamic,  and  mechanical  effects.  Geometrically,  the 
curvature  of  the  overall  structure  (as  determined  by  the 
dihedral  angle  formed  be  rive  en  three  adjacent  double -helices) 
can  be  tuned  by  translating  all  crossover  points  along  the 
direction  parallel  to  the  helical  axis  while  preserving  the 
spacing/  '  3  twist,  on  the  other  hand,  depends  on  the  crossover 
spacing.1  r+3,44  Mechanically,  strain  could  accumulate  in  twisted 
and/or  curved  structures.1  ,43,44  Such  mechanical  stress  in  turn 
could  affect  the  thermodynamic  properties  of  the  structure. 

Symmetry.  By  adjusting  domain  length,  crossover  type,  and 
crossover  patterns,  we  can  design  structures  with  more 
symmetric  strand  weaving  patterns,  such  as  corrugated 
patterns/7  Such  symmetry  can  help  to  eliminate  curvature  in 
the  structure. 

Linkers.  Structures  formed  from  motif  0  are  composed  of 
compact  parallel-arranged  double -helices  connected  by  periodic 
single-stranded  crossovers  (composed  of  a  single  phosphate  in 
the  DNA  backbone).  In  our  study,  we  included  single-stranded 
linkers  of  varying  lengths  between  helices  and  between  binding 
domains.  This  results  in  structures  that  are  geometrically  and 
structurally  more  flexible.  Inclusion  of  linkers  should  also  help 
to  relieve  the  electrostatic  repulsion  between  adjacent  double¬ 
helices,  hence  stabilizing  the  structure. 

Structures  Formed  from  Diverse  Motifs.  Gvemew.  By 
systematically  exploring  the  above  as  well  as  some  other  design 
parameters,  we  tested  a  total  of  36  different  motifs,  32  of  which 
formed  desired  structures  (an  89%  success  rate).  A 
representative  subset  of  all  successful  motif  designs  is  shown 
in  Figure  1  (see  SI,  section  SI1.3  for  the  full  repertoire). 
Different  morphologies  were  presented,  including  compact 
rectangle  and  parallelogram  of  packed  DNA  duplexes  (Figure 
lcl,c2),  rectangles  of  spaced  DNA  duplexes  (Figure  lc3),  and 
rectangles  of  square  and  hexagonal  lattices  (Figure  Ic4,c5), 
Structure  Assembly  and  Characterization.  The  DNA 
structures  were  assembled  with  unpurified  DNA  strands  with 
randomly  designed  sequences,  and  were  mixed  at  roughly 
equimolar  without  careful  adjustment  of  stoichiometry.  After 
single-step  (one-pot)  annealing  from  90  to  25  °C  over  17  h  in 
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Figure  2.  Curvature  models,  (a)  Curvature  model  (left)  and  corresponding  strand  diagram  (right)  of  motif  0,  Inset  depicts  the  major  groove  side  vs 
minor  groove  side,  (b)  Curvature  model  of  motif  3d  without  (left)  and  with  (right)  offset  angle  a,  (c)  Curvature  model  of  motif  3.2  with  offset  angle 
or.  (d)  Curvature  model  of  motif  4.1.  In  this  corrugated  design,  the  offset  angle  cr  is  canceled  out  so  the  overall  curvature  is  independent  of  a.  (e,f) 
Predicted  cross-sectional  views  of  24-helix  structures  formed  from  motifs  0  and  4.1,  respectively.  See  St,  section  SI2.I  and  Figure  S139  for  modeling 
and  calculation  details. 


0.5XTE  buffer  supplemented  with  15  mM  Mg2+  (see  SI,  section 
SI  1.2  for  the  effects  of  buffer  ion  strength  and  annealing  time 
on  the  assembly  yields),  the  solution  was  subjected  to  2% 
native  agarose  gel  electrophoresis  and  produced  one  dominant 
product  band,  the  ratio  between  the  fluorescence  intensity  of 
the  product  band  and  that  of  the  entire  lane  was  used  to 
approximate  the  yield  of  structure  formation,  which  was 
measured  to  range  from  5%  to  44%  (Figure  SI  1 }. 

Annealed  samples  were  subjected  to  atomic  force  microscopy 
(AFM)  imaging  and  showed  the  expected  morphology  and 
dimensions.  For  samples  with  relatively  low  yields  (motifs  4.6, 
7.2,  and  10),  structures  were  gel-purified  from  the  product 
bands  on  a  2%  agarose  gel  prior  to  AFM  imaging.  See  SI, 
section  SI  1.3,  for  larger  AFM  images  and  section  SI  1.5  for 
measurements  of  the  sizes  of  the  structures. 

Changing  Domain  Lengths:  Motif  7.  First,  we  adjusted  the 
domain  lengths  of  two  adjacent  domains  within  a  double-helix 
from  10  or  II  nt  to  9—12  nt  (motif  l).  This  changes  the 
distance  between  adjacent  crossovers  along  the  same  double- 
helix,  resulting  in  the  relative  change  of  the  dihedral  angle 
formed  by  the  three  adjacent  double -helices  by  34°.  This  motif 
was  later  used  to  build  a  larger  structure  for  curvature  study. 

Bidirectional  Motifs:  Motif  2.  Next,  we  flipped  the 
orientation  of  the  U-shaped  motifs  in  every  other  row,  resulting 
in  a  structure  composed  of  U-shaped  SSTs  with  alternating 
orientations  along  adjacent  rows.  Such  a  modification  makes 
the  overall  structure  more  symmetric  along  the  helical 
direction. 

Z-Shaped  Motifs:  Motifs  3,1  and  3.2.  Structures  formed 
from  U-shaped  motifs  have  antiparallel  crossovers.  We  then 
designed  structures  with  two  Z- shaped  motifs  and  obtained 
structures  containing  parallel  crossovers.  Such  parallel  crossover 
weaving  patterns  are  expected  to  cancel  out  the  curvature 
resulting  from  the  asymmetric  arrangement  of  major  and  minor 
grooves  at  the  crossovers  (see  Curvature  section  for  details) 
and  was  used  for  curvature  study. 

Corrugated  Structures:  Motif  4.  J.  We  further  changed  the 
orientation  of  every  other  row  in  the  Z-shaped  motif  to  form  a 
symmetric  structure  with  a  corrugated29  strand  pattern.  This 
corrugation  should  in  principle  eliminate  any  curvature  and 


result  in  a  perfectly  flat  structure.  This  structure  was  used  in  the 
curvature  study  below. 

Twisted  Structures:  Motifs  4.2-4.6.  Based  on  the 
corrugated  structure  from  motif  4.1,  we  changed  the  crossover 
spacing  from  the  canonical  two  full  helical  turns  (i.e.,  21  nt)  to 
20,  22,  24,  26,  IS  nt  using  motifs  4.2— 4.6,  respectively.  This 
introduced  a  significant  internal  twist  to  the  structure.1  43,44 
The  structures  with  20  nt  crossover  spacing  (motif  4.2)  were 
used  for  the  twist  study  and  significant  twist  was  observed  (see 
below).  The  yield  of  the  structure  with  the  shortest  spacing 
(motif  4.6)  was  relatively  low  (26%)  compared  to  that  of  the 
rest  of  the  group  (33—42%). 

Double-Stranded  Crossovers:  Motifs  5.x,  6.x,  and  7.x. 
Structures  formed  from  motifs  0—4  contain  single -stranded, 
half  crossovers  between  adjacent  double -helices.  We  next  tested 
structures  with  double-stranded,  full  crossovers.  We  first  tried 
to  shift  the  nicking  points  in  the  original  U-shaped  motif  to  be 
positioned  away  from  the  crossovers  (motifs  5.1— 5.3).  Then  we 
continued  to  design  H-shaped  motifs  by  increasing  crossover 
spacing  from  two  helical  turns  to  four  helical  turns  (motifs  6,1  — 
6.3).  We  also  designed  X-shaped  motifs  that  were  derived  Z- 
shaped  motifs  (motifs  7.1  and  7,2),  Structures  with  full 
crossovers  have  nicks  on  the  duplex  (rather  than  at  the 
crossovers),  which  could  permit  the  use  of  enzymatic  ligation  to 
covalently  join  the  two  ends  of  an  SST  molecule  to  form  a  ring. 

Hybrid  Motifs:  Motifs  8-10.  We  combined  different  motifs 
to  form  hybrid  structures  with  both  U-shaped  and  Z-shaped 
rows  (motifs  8  and  9)  or  with  H-shaped  and  X-shaped  rows 
(motif  10). 

Linkers  at  Crossovers:  Motifs  11J— 11  A,  At  the  crossovers, 
we  placed  single -stranded  DNA  linkers  with  varying  lengths  (20 
nt,  10  nt,  and  2  nt  for  motif  1 1.1,  11,3,  and  11.4  respectively). 
We  also  tested  to  form  structures  with  15  bp  double-stranded 
linkers  (motif  11.2).  In  addition  to  the  10HX1IT  structure,  we 
also  used  motif  11.1  to  form  a  24HX29T  structure  (Figure 
SI29),  demonstrating  the  scalability  of  structures  formed  from 
linker  motifs. 

Linkers  between  Neighboring  Domains:  Motif  12,  We 
formed  structures  from  a  motif  with  linkers  between  every  pair 
of  neighboring  domains  (motif  12).  In  the  resulting  structures, 
individual  10  or  11  bp  double-stranded  DNA  duplexes  were 
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Motif  0 


Motif  1  Motif  3.1  Motif  3.2  Motif  4,1 


Figure  3-  Curvature  characterization.  See  SI,  section  S12.I  for  curvature  calculation  and  modeling  details,  (a)  Strand  diagrams  (top)  and  curvature 
models  in  cross  section  views  (bottom),  (b)  AFM  images.  Scale  bars,  100  nm.  See  SI,  section  SI2.3  for  larger  images,  (c)  TEM  images.  Scale  bars, 
100  nm.  See  SI,  section  SL2.4  for  larger  images,  (d)  AFM-based  landing  assay  results,  A  structure  that  landed  with  its  front  side  facing  up  is  marked 
with  a  blue  dot  inside  a  red  circle;  the  one  with  back  side  facing  up  is  marked  by  a  hollow  red  circle.  Magnified  views  show  the  structures  with 
orientational  markers  (front  side  structures  in  blue  boxes  and  back  side  structures  in  red  boxes).  The  ratio  between  two  tanding  orientations  is  shown 
in  the  bar  chart  in  the  bottom  panel  (blue,  front  side;  red,  back  side).  See  Si,  section  512.5  for  larger  images  and  more  details,  (e)  Representative 
DNA-PAINT  super  resolution  images  of  different  SST  structures  labeled  at  four  comers  (scale  bars,  100  nm).  Measured  point  distances  for  motifs  0, 
l,  3*1,  3*2,  and  4.1  were  73  nm,  27  and  73  nm  (short  and  long  axis),  31  and  154  nm  (parallelogram  short  and  long  diagonal),  45  and  148  nm 
(parallelogram  short  and  long  diagonal),  and  45  and  77  nm  (short  and  long  axis),  respectively,  (f)  Schematics  of  AFM-based  landing  assay,  (g) 
Schematics  of  DNA-PAINT  super- resolution  imaging.  Left  panel:  an  SST  structure  was  labeled  on  the  top  surface  with  9  nt  single-stranded 
“docking"  sites  for  DNA-PAINT  imaging  strands.  It  was  also  labeled  on  the  bottom  surface  with  biotinylated  strands  for  surface  attachment.  Center 
panel:  imager  strands  bind  transiently  to  the  docking  sites  in  the  four  comers,  producing  apparent  blinking.  Right  panel:  diffraction -limited  T1RFM 
image  (left)  and  super- resolution  image  (right).  Scale  bars,  100  nm. 


connected  by  flexible  1 0  nt,  single -stranded  poly-T  linkers.  The 
structure  was  thus  likely  loosely  connected  without  well-defined 
morphology*  AFM  imaging  was  performed  only  after  the 
addition  of  10  nt  poly-A  strands,  which  were  used  to 
complement  the  poly-T  linkers,  and  helped  to  result  in  a 
structure  with  better  defined  shape.  After  poly- A  complemen¬ 
tation,  grid -like  structures  were  observed. 

Wire- fra  me  Lattices:  Motifs  13  and  14.  We  constructed 
hexagonal  lattices  with  different  mesh  sizes;  one  formed  from  a 
motif  with  three  21-nt  domains  (motif  13)  and  another  from  a 
motif  with  six  10-nt  domains  (motif  14).  The  hexagon  pattern 
is  similar  to  previous  work  on  hexagon  lattice-based  tubes,43  In 


contrast  to  the  typical  structures  composed  of  parallel- arranged 
double -helices  linked  by  periodic  (half)  crossovers,  these  wire¬ 
frame  structures40  41  are  composed  of  short  duplex  fragments 
connected  with  junction  points.  Interestingly,  due  to  the 
electrostatic  repulsions  between  adjacent  DNA  double-helices, 
a  structure  composed  of  parallel-arranged  double -helices 
typically  possesses  a  gap  (e.g.,  0.5  — 1,5  nm  in  DNA 
origami 0,1  ,l  6  and  SST  structures  2"20'21)  between  the  adjacent 
crossovers,  and  can  be  viewed  as  a  wire- frame  structure  with  a 
rectangular  mesh  pattern.  In  SI,  section  SI  1.4,  we  discuss  the 
transformation  between  parallel  pattern  and  rectangular  ware- 
frame  pattern  in  more  detail. 
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Curvature.  Modeling  and  Design ,  The  strand-weaving 
pattern  of  the  strands  can  affect  the  curvature  of  the  structure. 
Figure  2  summarizes  our  strategies  to  control  structure 
curvature.  SI,  section  SI2.1  gives  modeling  and  analysis  details. 

Domain  Length  Adjustment  Working  toward  our  goal  of 
controlling  curvature  to  design  a  flat  structure,  we  first 
attempted  to  reduce  curvature  by  adjusting  the  domain  length 
of  the  canonical  U-shaped  SST  motif  Assuming  that  the 
crossover  points  between  adjacent  double -helices  are  collinear 
with  the  centers  of  the  adjacent  double -helices,  our  model 
predicts  an  average  curvature  of  30°  per  double -helix  for  the 
motif  0,  and  720°  for  the  24HX29T  rectangle  (see  Figure  2a, e 
for  cross-sectional  view  of  zoomed-in  structure  and  the  overall 
structure?  see  SI,  section  SI2.1.1  for  detailed  calculation).  This 
curvature  was  confirmed  by  our  experimental  data  (Figure  3, 
column  motif  0). 

By  adopting  a  domain  length  of  9/12  nt  (motif  l)  instead  of 
the  canonical  10/11  nt  in  motif  0,  we  were  able  to  reduce  the 
calculated  average  curvature  per  double-helix  from  30°  to  —4°, 
and  -96°  for  the  24H  rectangle  (see  SI,  section  512.1.1  for 
detailed  calculation).  This  reduced  curvature  was  confirmed  by 
experimental  data  (Figure  3,  column  motif  l). 

However,  a  2D  structure  (such  as  that  formed  by  motif  0  or 
motif  l)  designed  with  antiparallel  crossovers  will  always 
possess  a  side  that  displays  minor  grooves  at  crossovers  and 
another  side  that  displays  major  grooves  at  crossovers.4'  As 
such,  the  curvature  will  not  be  perfectly  counterbalanced  for 
antiparallel  crossover-based  structures  designed  from  U-shaped 
motifs  (Figure  2a). 

Parade I  Crossover  Design .  Next,  we  hypothesize  that  the 
curvature  induced  by  the  asymmetric  arrangement  of  major  and 
minor  grooves  at  the  crossovers  on  the  two  sides  of  the 
structures  would  be  negated  in  parallel  crossover-based 
structures3  formed  from  Z-shaped  motifs;  here,  on  either  side 
of  the  structure,  the  crossovers  display  major  and  minor 
grooves  in  an  alternating  fashion.  Like  the  U-shaped  motif,  we 
assume  the  crossover  points  of  adjacent  double-helices  are 
collinear  with  the  centers  of  the  adjacent  double -helices,  and 
designed  a  Z-shaped  motif  (motif  3.l)-based  structure  that  is 
expected  to  be  perfectly  flat  (see  Figures  2b,  left  panel;  see  SI, 
section  512.1.2  for  detailed  calculation).  However,  our 
experimental  data  revealed  significant  curvature  in  the  structure 
(Figure  3,  column  motif  3.1). 

The  discrepancy  between  our  model  and  our  experiments  led 
us  to  revise  the  previous  assumption  that  the  crossover  points 
of  adjacent  double-helices  are  collinear  with  the  centers  of  the 
adjacent  double-helices.  Although  our  experimental  results 
from  U-shaped  motifs  (motifs  0  and  1 )  did  not  invalidate  such 
an  assumption,  the  results  from  Z-shaped  motifs  were  dearly 
incompatible  with  this  collinear  assumption.  To  reconcile  this 
discrepancy  between  the  model  and  the  experimental  results, 
we  introduced  an  offset  angle  a  formed  between  a  base  at  the 
crossover  and  the  two  center  points  of  the  adjacent  double¬ 
helices  (middle  panels  of  Figures  2b  and  Sl39b).  In  a  structure 
formed  from  motif  3.1,  this  offset  angle  at  the  crossover  always 
appears  on  the  same  side  of  the  structure,  resulting  in  an 
accumulative  curvature  for  tills  otherwise  flat  structure. 

Indden tally,  another  structure  based  on  a  Z-shaped  motif 
with  a  different  domain  length  (motif  3.2)  appeared 
approximately  flat  in  our  experiments  (Figure  3,  column 
motif  3.2).  Assuming  this  structure  (motif  3.2)  has  0°  curvature, 
we  back-calculated  an  approximate  value  for  the  offset  angle  a 
(8,5°,  Figure  2c),  Using  this  a  value,  the  24HX29T  structure 


formed  from  motif  3.1  was  expected  to  have  significant 
curvature  (see  Figure  2b,  middle  panel;  see  SI,  section  512.1.2 
for  detailed  calculation),  consistent  with  experimental  observa¬ 
tion  of  a  curved  structure. 

Corrugation  Design.  We  next  devised  a  corrugated  design 
(motif  4.1)  where  the  offset  angle  a  appears  alternately  on  the 
two  sides  of  the  structure,  thus  canceling  out  the  curvature 
effect  (independent  of  its  value,  Figure  2d,f;  see  SI,  section 
S 12, 1.3  for  detailed  calculation).  The  predicted  flatness  of  the 
structure  was  experimentally  verified  (Figure  3,  column  motif 
4.1). 

Curvature:  Experimental  Characterization.  We  per¬ 
formed  a  case  study  below  to  test  the  curvature  of  24HX29T 
rectangle  structures  of  a  number  of  motifs. 

Characterization  by  AFM  and  TEM.  To  study  the  curvature 
of  the  structures,  we  constructed  24HX29T  structures  with 
each  of  the  above  five  motifs.  Sequence  design  and  structure 
assembly  were  conducted  in  a  similar  fashion  as  their  10HX 1  IT 
counterparts.  Then  the  annealed  samples  were  subjected  to  2% 
native  gel  electrophoresis  (see  Figure  S 141  for  gel  data),  and  the 
product  band  was  extracted,  purified  by  centrifugation,  and 
imaged  with  AFM  and  TEM.  Under  AFM,  all  the  structures 
appeared  as  flat,  single-layer  structures  on  mica  surface  (Figure 
3b);  no  curvature  information  was  revealed.  However  TEM 
images  more  effectively  reflected  the  structure  configurations  in 
solution  (Figure  3c):  the  structure  generated  from  motif  0 
(720°  curvature  expected)  appeared  as  a  dense,  rolled-up 
rectangle;  the  structure  from  motif  1  (—96*  expected)  appeared 
as  a  wider  rectangle;  rolled-up  and  flat  parallelograms  were 
observed  for  the  structure  made  from  motifs  3.1  and  3,2, 
respectively;  and  a  flat  rectangle  was  observed  for  structure 
from  motif  4,1  (0°  curvature  expected). 

AFM-Based  Landing  Assay.  The  different  curvature 
configurations  of  the  structures  were  also  confirmed  by  two 
innovative  assays.  The  first  was  the  AFM-based  landing  assay20 
in  which  a  small  hole  was  introduced  to  the  top-left  comers  of 
the  structure  as  an  orientational  marker  (Figure  3f)-  If  the 
marker  was  more  likely  to  show  up  at  the  top  left  comer  rather 
than  its  mirror  image  (top  right  corner),  it  indicated  the 
structure  landed  on  the  mica  with  its  front  side  facing  up, 
suggesting  the  structure  curved  up  instead  of  down  (motif  0), 
On  the  contrary,  if  the  mirror  image  (back  side  facing  up)  was 
more  likely,  the  structures  had  the  tendency  to  curve  down 
(motif  3.1).  It  the  marker  showed  no  bias  for  either  side,  the 
structure  would  be  flat.  Our  experimental  results  (Figure  3d) 
showed  a  perfect  bias  for  the  rolled-up  structures  in  the 
predicted  curving  direction  [motif  0,  100%  front  side  facing  up 
( N  -  36);  motif  3,1,  100%  back  side  facing  up  ( N  =  57)]. 
However,  possibly  due  to  a  limited  sample  size,  the  assay  did 
not  effectively  differentiate  between  other  less  curved  structures 
[motif  1,  65%,  front  side  facing  up  (JV  =  74);  motif  3.2,  42% 
front  side  facing  up  (N  —  45);  motif  4, 1,  66%  front  side  facing 
up  (N  =  203)],  Note  that  only  well-formed  structures  (defined 
as  those  showing  no  defects  >15  nm  in  diameter  in  the 
expected  boundary,  nor  >10  nm  in  diameter  in  the  interior  of 
the  structure)  were  counted  in  our  statistics.  See  SI,  section 
SI2,5  for  details, 

Super-Resolution  Fluorescence  Microscopy  Assay .  Super¬ 
resolution  fluorescence  microscopy  was  the  second  approach 
employed  to  assess  the  curvature  of  SST  structures  (Figure  3g). 
Here  we  used  DNA-PAINT4*32'35  to  obtain  sub -diffraction 
images  of  SST  structures.  DN A- PAINT  exploits  the  repetitive, 
transient  binding  of  short  fluorescently  labeled  oligonucleotides 
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('"imager"  strands)  to  complementary  strands  ("docking”  sites) 
on  a  structure  or  molecule  of  interest.  Using  total  internal 
fluorescence  microscopy  (TIRFM),  binding  events  of  imager 
strands  to  docking  sites  were  observed  as  single-molecule 
events  and  the  fluorescence  emission  was  fitted  to  a  2D 
Gaussian  function,  yielding  sub -diffraction  resolution  images. 
SST  structures  were  "labeled"  with  DNA-PAINT  docking  sites 
by  extending  the  three  outmost  strands  in  each  of  the  four 
comers  by  a  9  nt  long  sequence  at  their  3'-end  (Figure  3g, 
center  panel).  The  structures  also  carried  four  biotinylated 
strands  pointing  toward  the  opposite  side  of  the  docking 
strands  for  surface  attachment  (Figure  3g,  left  panel)*  Once 
bound  to  a  functionalized  glass  slide,  fiuorescently  labeled 
DNA-PAINT  imager  strands  were  introduced,  which  tran¬ 
siently  bound  to  the  docking  strands.  The  right  panel  of  Figure 
3g  includes  a  typical  diffraction-limited  TIRF  image  of  a  DNA 
nanostructure  alongside  the  DNA-PAINT  image  after 
reconstruction*  The  increase  in  resolution  is  clearly  visible. 

The  results  of  the  DNA-PAINT  study  on  the  five  different 
SST  motifs  are  in  good  agreement  with  the  TEM  analysis  as 
well  as  the  AFM -based  landing  assay  (Figure  3e).  For  motif  03 
only  two  apparent  points  in  a  distance  corresponding  to  the 
long  side  of  the  rectangular  structure  were  visible,  suggesting  a 
"rolled -up”  structure  where  the  two  comers  were  placed  right 
next  to  each  other,  or  only  one  of  the  two  corners  was  available 
to  bind  the  imager  strand*  In  contrast,  in  the  structure  from 
motif  1  we  observed  four  separate  points,  two  of  which  were  in 
close  proximity,  suggesting  a  slightly  curved  structure.  Images 
for  motifs  3*1  and  3.2  were  consistent  with  an  almost  rolled-up 
structure  (3.1 )  and  a  flat  parallelogram  structure  (3.2), 
respectively.  Finally,  the  image  for  motif  4*1  (with  four  visible 
comers)  suggests  a  fiat  rectangular  structure. 

Twist*  Besides  assessing  the  curvature  in  different  SST 
designs,  we  also  studied  the  twist  of  the  SST  structures  by 
creating  homopolymers  of  motifs  4.1  and  4.2.  We  designed  the 
leftmost  column  of  the  SST  tiles  to  pair  with  the  rightmost 
column  of  the  rectangle.  As  a  consequence,  multiple  units  of 
rectangles  align  in  tandem  to  form  a  long  polymer  with 
individual  rectangles  as  monomer  units  (Figure  SI58,  top). 
Polymerization  helps  to  identify  the  small  degrees  of  twist  that 
would  be  difficult  if  not  impossible  to  visualize  using  only 
monomeric  structures.1  ,l44  AFM  images  revealed  that  the 
homopolymers  formed  from  motif  4,1  yielded  ribbons  with 
virtually  no  global  twist  (Figure  SI58a,  bottom).  This  was 
expected,  as  the  design  used  21  bases  between  adjacent 
crossover  points,  thus  obeying  10*5  bp  spacing  per  helical  turn 
as  in  the  natural  form  of  B-DNA.  In  contrast,  polymers  formed 
from  motif  4.2  showed  a  global  left-handed  twist  (Figure  SI 5 8b, 
bottom).  This  is  consistent  with  the  fact  that  motif  4.2  used  20 
bases  between  adjacent  crossover  points  and  was  expected  to 
produce  a  locally  overwound  double-helix  Our  results  agree 
with  earlier  studies  of  curvature  and  twist  for  DNA  origami 
structures17'43,44  and  suggest  that  the  twist  of  the  SST 
structures  can  be  modulated  by  changing  crossover  spacing 
distances. 

■  CONCLUSION 

In  summary,  we  evaluated  more  than  30  SST  motifs  that  form 
lattice  structures  with  diverse  strand  weaving  patterns.  Most  of 
the  motifs  self-assembled  into  the  designed  structures  with 
reasonable  yields.  Using  some  of  the  motifs,  a  systematic  study 
was  conducted  to  reduce  the  curvature  in  the  assembled 
structure.  The  difference  in  curvature  was  characterized  by 


TEM,  an  AFM-based  landing  assay,  and  super- resolution 
microscopy.  A  corrugated  weaving  pattern  was  predicted  and 
experimentally  verified  to  produce  a  flat  structure. 

The  simple  and  robust  nature  of  SSTs  has  enabled  us  to 
survey  the  structural  space  for  forming  complex  structures  in  a 
rapid  and  comprehensive  fashion*  An  unprecedented  collection 
of  diverse  weaving  patterns  was  demonstrated.  We  were  able  to 
define  simple- to- implement  design  rules  that  lead  to  structures 
with  specific  geometric  properties  such  curvature  and  twist. 
More  broadly,  the  diverse  collection  of  motifs  will  allow  us  to 
modulate  the  geometrical,  mechanical,  and  thermodynamic 
parameters.  We  demonstrated  the  tuning  of  curvature  and  twist 
with  the  new  motifs.  In  a  related  work, 49  we  demonstrated  that 
by  using  the  motifs  reported  here,  we  can  form  complex  DNA 
nanostructures  iso  thermally  at  any  prescribed  temperature 
between  IS  and  69  °C,  and  under  biocompatible  conditions. 
The  work  here  thus  provides  a  new  set  of  tools  for  modulating 
the  structural  properties  of  complex  DNA  structures,  and 
reveals  the  broad  design  space  for  forming  such  structures* 

■  METHODS 

Finite  DNA  nanostructures  were  first  designed  with  different  motifs, 
and  then  random  sequences  were  generated  to  fill  in  the  specific 
structures  and  populate  complementary  domains.  Without  careful 
adjustment  of  stoichiometry,  unpurified  strands  were  mixed  and 
supplemented  with  O.SxTE  and  15  muM  Mg2\  After  one-pot  annealing 
fi-om  90  to  25  °C  over  17  h,  the  solution  was  subjected  to  native 
agarose  gel  electrophoresis.  The  annealed  samples,  or  the  gel- purified 
samples  from  extraction  of  the  desired  band  by  centrifugation  were 
imaged  with  AFM  or  TEM.  The  curvature  and  twist  of  certain 
structures  were  studied  by  AFM,  TEM,  and  super- re  solution 
fluorescent  imaging, 

DNA  Sequence  Design.  Most  DNA  sequences  were  designed 
with  the  Uniquimer  software  by  populating  the  motifs  with  random 
sequences*0  while  maintaining  the  required  complementarity  relations. 
The  following  design  rules  were  applied  for  sequence  generation;  (1) 
Nucleotides  (i.e,,  A,  C,  G,  T)  are  generated  one  by  one  randomly.  (2) 
Complementary  nucleotides  to  one  generated  are  matched  following 
the  base-pairing  rule;  A  to  T,  C  to  G  and  vice  versa*  (3)  Specific 
segments  (e*g.,  four  consecutive  A  Q  G,  T)  are  not  allowed  When 
such  segments  emerge  during  design,  the  most  recent  generated 
nucleotide  will  be  mutated  until  disallowance  of  specific  segments  is 
satisfied.  Due  to  the  difficulty  of  designing  structures  with  wire  frame 
motifs,  a  separate  script  was  developed  following  the  same  design 
principles.  Manual  design  and/or  optimization  was  used  for  the  linker 
segment  sequences  (e,g., motifs  11*1-11.4  and  motif  12)  and  the 
handle  segment  sequences  (e.g,,  handle  segment  sequences  for 
structures  of  motifs  0,  1,  3,1,  3,2,  and  4.1  in  super-resolution  study. 

Sample  Preparation.  DNA  strands  were  synthesized  by 
Integrated  DNA  Technology,  Inc.  (http;//www.idtdna*com)*  To 
assemble  the  structures,  DNA  strands  were  mixed  to  roughly  equal 
molar  concentration  of  200  nM  for  smaller  structures  (all  10HX 1  IT 
structures,  structures  with  linkers  and  wire- frame  structures)  and  100 
nM  for  larger  structures  (24HX29T  and  24HX28T  structures  for  the 
curvature  study)  in  O.SxTE  buffer  (5  mM  Tris,  pH  7,9,  1  mM  EDTA) 
supplemented  with  15  mM  MgCl2.  The  mixtures  was  then  annealed  in 
a  PGR  thermo  cycler  by  cooling  from  90  to  25  °C  over  a  period  of  17 
h  with  a  specific  cooling  program,20  The  annealed  samples  were  then 
subjected  to  2%  agarose  gel  electrophoresis  (gel  prepared  in  O.SxTE E 
buffer  supplemented  with  10  mM  MgCL  and  pre -stained  with  5YBR 
safe)  in  an  ice  water  bath.  If  necessary,  the  target  gel  bands  were 
excised  out  and  put  into  a  Freeze  N'  Squeeze  column  (Bio-Rad)*  The 
gel  pieces  were  crushed  using  a  microtube  pestle  in  the  column  and 
the  column  was  then  directly  subjected  to  centrifuge  at  433g  for  3  min. 
Purified  samples  were  collected  in  the  eluare,  and  concentrations  were 
determined  by  Nanodrop  absorption  at  260  nm  prior  to  AFM  or  TEM 
imaging. 
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AFM  Imaging.  AFM  images  were  obtained  using  an  SPM 
Multimode  with  Digital  Instruments  Nanoscope  V  controller 
(Veeco).  A  5  pL  droplet  (2—10  nM)  of  annealed  (or  purified)  sample 
and  then  a  40  pb  drop  of  0.5xTE/i0  mM  MgCL  solution  were 
applied  to  a  freshly  cleaved  mica  surface  and  left  for  approximately  2 
min.  Sometimes,  additional  dilution  of  the  sample  was  performed  to 
achieve  the  desired  sample  density.  As  for  the  cases  of  10HX11T 
structures,  20  /rL  of  supplemental  10  mM  NiCl2  was  added  to  increase 
the  strength  of  DNA— mica  binding.51  The  images  were  taken  under 
the  liquid  tapping  mode,  with  C-type  triangular  tips  (resonant 
frequency, /0  =  40—75  kHz;  spring  constant,  fc  =  0.24  N  m_l)  from  the 
SNL-10  silicon  nitride  cantilever  chip  (Bruker  Corp.). 

TEM  Imaging.  For  TEM  imaging,  a  3.5  fib  sample  (1—5  nM)  was 
adsorbed  onto  glow  discharged  carbon- coated  TEM  grids  for  4  min 
and  then  stained  for  1  min  or  a  few  seconds  using  a  2%  aqueous  uranyl 
formate  solution  containing  25  mM  NaOH.  Imaging  was  performed 
using  a  JEOL  JEM-1400  TEM  operated  at  80  kV* 

Super-Resolution  Imaging.  Super-resolution  studies  were 
performed  with  DNA  PAINT*43  The  samples  were  diluted  in  DNA- 
PAINT  imaging  buffer  (5  mM  Tris-HCl  (pH  8.0  ),  10  mM  MgG,,  1 
mM  EDTA,  0.05%  Tween -20)  to  50  pM  and  immobilized  to  a 
coverslip  in  a  flow  chamber  via  biotin  streptavidin  binding  interaction. 
To  fix  the  samples,  solutions  containing  1  mg/mL  BSA-biotin,  2  mg/ 
mL  streptavidin,  and  biotin-labeled  DNA  nanostructures  were  flushed 
into  the  flow  chamber  and  incubated  in  the  above  order.  The  chamber 
was  then  washed  with  DNA-PAXNT  imaging  buffer  containing  20  nM 
A TT06S5-1  abded  imager  strands  and  sealed  with  5  min  epoxy  before 
imaging.  DNA-PAINT  super-resolution  images  were  acquired  on  an 
inverted  Nikon  Ti-E  microscope  (Nikon  Instruments,  Melville,  NY) 
with  the  Perfect  Focus  System,  applying  an  objective-type  TIRE 
configuration  using  a  Nikon  TIRF  illuminator  with  an  oil-immersion 
objective  (100X  Plan  Apo,  NA  1.49,  Oil,  Nikon).  A  647  nm  laser 
(Agilent  MLC400B,  SO  mW  at  the  objective)  was  used  for  TIRF 
excitation.  The  laser  beam  was  filtered  with  a  dean  up  filter  (642/20 
Chroma  Technologies)  and  coupled  into  the  microscope  objective 
using  a  multiband  beam  splitter  (zt4G5/488-491/56l/63Srpe,  Chroma 
Technologies).  Fluorescence  was  spectrally  filtered  with  an  emission 
filter  (700/75  Chroma  Technologies)  and  imaged  on  an  EMCCD 
camera  (Andor  iXon  3,  Andor  Technologies,  North  Ireland).  A  total  of 
5000  frames  were  recorded  at  a  frame  rate  of  10  Hz.  Super-resolution 
images  were  reconstructed  using  spot-finding  and  2D  Gaussian  fitting 
algorithms  programed  in  Lab  VIEW  (National  Instruments)  available 
for  download  at  www.dna-paint.org. 

Yield  Quantification  by  Gel  Electrophoresis.  Yield  was 
estimated  by  analysis  using  native  agarose  gel  electrophoresis,  pre- 
stained  with  SYRR  Safe  DNA  stain.  The  ratio  between  the 
fluorescence  intensity  of  the  product  band  and  that  of  the  entire 
lane  was  taken  as  an  estimate  of  the  gross  yield  of  structural  formation. 

Structure  Size  Measurement  and  Statistics,  AFM  measure¬ 
ments  were  obtained  using  Nanoscope  Analysis  (version  1.20;  Vecco). 
The  "cross-section"  function  was  used  to  measure  distances  (lengths 
and  widths  of  the  structures  of  different  sizes).  Well-formed  structures 
were  chosen  for  the  measurements.  TEM  images  were  analyzed  using 
Image}  ("version  1.46r;  NIH).  The  straight  line  function  was  used  to 
measure  widths  of  certain  structures.  Six  or  ten  sample  points  were 
collected  for  each  distance  measurement  and  the  statistics  (e.g.,  the 
mean  and  the  standard  deviation)  were  based  on  those  6  or  10  data 
points. 
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Experimental  details,  additional  figures,  and  sequence  data  for 
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ABSTRACT:  Nucleic  acid  nanotechnology  has  enabled  researchers  to  construct  a 
wide  range  of  multidimensional  structures  in  vitro.  Until  recently,  most  DNA-based 
structures  were  assembled  by  thermal  annealing  using  high  magnesium 
concentrations  and  nonphysiological  environments.  Here,  we  describe  a  DNA  self- 
assembly  system  that  can  be  tuned  to  form  a  complex  target  structure  iso  therm  ally  at 
any  prescribed  temperature  or  homogeneous  condition  within  a  wide  range.  We  were 
able  to  achieve  isothermal  assembly  between  15  and  69  °C  in  a  predictable  fashion  by 
altering  the  strength  of  strand— strand  interactions  in  several  different  ways,  for 
example,  domain  length,  GC  content,  and  linker  regions  between  domains.  We  also 
observed  the  assembly  of  certain  structures  under  biocompatible  conditions,  that  is, 
at  physiological  pH,  temperature,  and  salinity  in  the  presence  of  the  molecular 
crowding  agent  polyethylene  glycol  (PEG)  mimicking  the  cellular  environment.  This 
represents  an  important  step  toward  the  self-assembly  of  geometrically  precise  DNA 
or  RNA  structures  in  vivo. 

KEYWORDS:  DNA  nanotechnology ,  isothermal  assembly ,  single-stranded  tiles,  biocompatible  assembly,  molecular  crowding 


Nucleic  acid  self-assembly  has  proven  to  be  a  powerful  tool 
for  constructing  nanoscale  structures  due  to  the  precise 
and  predictable  relationship  between  structure  geometry  and 
the  encoded  sequence.  Over  the  past  few  decades,  a  wide  range 
of  DNA  and  RNA-based  nanostructures  have  been  demon- 
st  rated,  including  a  variety  of  two-dimensional  (2D)  and  three- 
dimensional  (3D)  geometries1-14  and  dynamic  devices.1 3  ~z" 
Additionally,  such  nanostructures  can  interface  with  and 
organize  molecules,  cells  and  other  materials/"12'23-27  thereby 
enabling  biophysicaD4,2H  and  potential  biomedical23  ~9r3° 
applications*  The  self-assembly  of  individual  strands  into 
DNA  nanostructures  typically  involves  a  thermal  gradient  in 
which  the  system  temperature  is  first  raised  (e.g.,  to  SO— 90  °C) 
and  then  gradually  lowered  to  room  temperature.  This  thermal 
gradient  can  be  replaced  with  a  chemical  gradient  of  a 
denaturing  agent  to  isotherm  ally  assemble  precisely  controlled 
DNA  origami  structures.31  In  addition  to  gradient- based 
assembly,  researchers  have  demonstrated  the  isothermal 
formation  of  extended  crystals1^3233  using  DNA  and  RNA 
tiles  and  discrete  structures  using  DNA  origami,34  both  under 
homogeneous  conditions. 

Isothermal  assembly  of  DNA  tiles  and  origami  under 
homogeneous  conditions  represents  important  progress  for 
the  assembly  of  complex  DNA  nanostructures.  Compared  with 
thermal  annealing,  assembling  structures  in  homogeneous 
conditions  leads  to  rapid  structure  formation  with  higher 
yield  and  quality.34  However,  previous  work  on  DNA  tile 
crystals  and  origami  still  requires  a  highly  optimized  set  of 
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assembly  conditions  (temperature,  salinity,  etc.)  that  are 
structure- dependent  and  restricted  to  a  narrow  range,  thereby 
limiting  the  scope  of  potential  applications.  In  contrast, 
assembly  across  a  ’wide  range  of  conditions  will  increase  the 
range  of  applications  of  DNA  nanotechnology,  especially  when 
the  assembly  conditions  are  specified  by  the  application,  such  as 
in  vivo  scaffolding12  of  metabolic  enzymes  (where  the  assembly 
is  required  to  happen  under  intracellular  conditions)  or  in  situ 
imaging33  (where  the  assembly  is  required  to  happen  under 
harsh  denaturing  conditions).  Thus,  the  next  important 
challenge  is  to  derise  a  general  method  in  which,  given  a 
particular  homogeneous  condition  and  a  target  shape,  the  user 
can  design  a  system  that  assembles  under  that  particular 
condition  into  the  desired  shape,  e,g.  the  isothermal  assembly 
of  a  particular  shape  under  any  prescribed  temperature  over  a 
wide  temperature  range.  Such  capability  will  greatly  expand  the 
application  scope  where  an  assembly  system  can  be  designed  to 
operate  under  diverse  homogeneous  conditions  specified  by  the 
desired  application.  Here  we  provide  a  general  solution  to  this 
challenge. 

The  key  innovation  in  our  approach  is  that  we  exploit  the 
tunable  nature  of  single-stranded  tile  (SST)  structures  and 
demonstrate  structural  designs  that  can  assemble  iso  thermally 
at  a  wide  range  of  temperature  and  salt  concentrations, 
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including  conditions  mimicking  intracellular  environments. 
Compared  with  DNA  origami,  which  typically  uses  a  fixed 
scaffold  sequence,  single-stranded  tiles  (SSTs)  give  us  the 
freedom  to  independently  vary  the  sequence  of  any  tile  in  the 
structure.  We  use  this  freedom  to  program  the  strength  of 
strand-strand  interactions,  thereby  shifting  the  optimal 
assembly  temperature  to  a  user-specified  temperature.  In 
other  words,  the  assembly  temperature  of  a  structure  is  now 
an  explicit  design  parameter.  This  will  allow  one  to  tune  the 
design  of  a  structure  to  fit  a  particular  application. 

We  report  the  success  fill  isothermal  assembly  of  modified 
SST  structures  across  a  wide  range  of  temperatures.  Building 
upon  our  previous  SST  work,5'13'  4,36  we  test  1 1  compact  2D 
SST  structures  (figure  la)  and  17  flexible  variants  with  single- 


a 


Figure  L  Self-assembly  of  DNA  tiles  under  diverse  and  biocompatiblc 
conditions.  (a,b)  We  use  standard  single-stranded  tiles  (SST;  a)  and 
SST  tiles  containing  single-stranded  linker  regions  between  domains 
(b)  to  assemble  2D  rectangular  structures.  A  canonical  SST  has  four 
domains  that  bind  to  four  of  its  nearest  neighbors.  Each  colored 
segment  depicts  an  individual  domain  (unique  in  sequence  and 
between  8  and  21  nt  long  in  our  study),  a  gray  solid  connection 
delineates  the  boundary  between  two  domains,  and  a  gray  dotted  line 
depicts  a  single-stranded  Linker  region  (between  l  and  16  nt  in  our 
study).  Each  complete  structure  (shown  on  the  right)  consists  of  66 
unique  SSTs  with  edges  protected  by  polyT  segments  to  avoid 
aggregation  (light  gray  segments),  (c)  A  schematic  depicting  the 
traditional  thermal  annealing  protocol  for  structure  assembly,  where 
temperature  is  generally  ramped  above  60  *C  and  slowly  decreased 
over  at  least  12  h.  (d)  A  schematic  depicting  our  isothermal  assembly 
protocol  across  a  wide  range  of  fixed  temperatures  and  conditions  for 
up  to  12  h. 


stranded  polyT  linkers  between  the  double-stranded  domains 
(Figure  lb).  6  Instead  of  the  traditional  thermal  annealing 
protocol  (Figure  lc),  all  structures  were  assembled  isothermally 
(Figure  Id).  Our  approach  therefore  represents  a  general 
method  for  isothermally  assembling  complex  nanostructures 
across  a  wide  range  of  specified  temperatures.  We  generally  find 
higher  assembly  temperatures  for  structures  with  longer 
domains,  and  lower  temperatures  for  those  with  linkers.  We 


also  show  that  the  assembly  temperature  can  be  modulated  by 
changing  the  strand -strand  binding  energy  in  several  ways, 
such  as  altering  GC  content  of  binding  domains  or  disrupting 
continuity  of  complementary  segments. 

Finally,  we  report  successful  assembly  of  designed  DNA 
nanostructures  under  bio  compatible  conditions,  that  is,  at 
physiological  temperature  and  pH,  low  salinity,  and  in  the 
presence  of  molecular  crowding  agents.  Flexible  structures, 
which  assemble  at  a  wider  range  of  conditions  than  their 
nonflexible  counterparts,  assemble  well  under  biocompatible 
conditions.  This  represents  a  significant  step  toward  the  in  vivo 
assembly  of  geometrically  precise,  nucleic  acid  nanostructures. 
More  generally,  this  approach  may  enable  additional  biological 
applications  of  nanostructures  in  cases  where  in  situ  delivery  of 
p  re  assembled  structures  is  not  an  op  bom 

In  our  single- stranded  tile  (SST)  system,  each  strand  consists 
of  four  binding  domains,  with  or  without  a  single-stranded 
linker  region  between  consecutive  domains.  In  the  presence  of 
divalent  cations,  each  strand  is  bound  to  its  four  neighbors  via 
complementary  domains,  resulting  in  a  self-assembled  2D  SST 
structure,13  Most  of  our  structures  are  derived  from  motifs  in 
our  recent  work36  and  consist  of  66  SSTs  that  form  a  2D 
rectangular  shape.  Two  categories  of  motifs  are  used.  The 
motifs  without  linkers  form  2D  rectangles  composed  of  10 
parallel  helices  (with  length  ranging  from  108  to  252  nt) 
connected  by  periodic  single-stranded  crossovers.  Motifs  with 
polyT  linkers  form  flexible  rectangular,  fish-net  patterns 
containing  short  segments  of  DNA  heUces  connected  by 
single-stranded  linkers  at  all  junctions.  We  varied  domain 
length,  linker  length,  and  other  design  parameters.  For  an 
overview  of  the  system,  see  Supporting  Information  Figure  S 1 
and  Text  SI. 

SST  structures  can  isothermally  assemble  across  a  wide  range 
of  temperatures  (Figure  2).  We  tested  a  set  of  U-shaped 
motifs13,36  to  form  2D  rectangular  structures.  By  altering  the 
domain  lengths  of  the  SST  motifs  from  8  to  21  nt,  we  were  able 
to  achieve  isothermal  assembly  across  a  54  °C  range  (from  15 
to  69  °C),  Structure  formation  was  assayed  by  gel  electro¬ 
phoresis  and  atomic  force  microscopy  (AFM).  A  dominant 
product  band  was  observed  after  isothermal  assembly  for  12  h 
at  the  indicated  temperatures  in  the  presence  of  10  mM  Mg3*. 
The  formation  ranges  indicated  on  the  temperature  Line  are  a 
subset  of  the  temperature  ranges  over  which  a  dominant 
product  band  was  observed  on  the  gel  (see  Supporting 
Information  Figure  S2  for  full  range  and  raw  gel  data).  In 
some  cases,  we  observed  the  presence  of  high  molecular  weight 
bands  or  smears,  presumably  due  to  structure  aggregation. 

Imaging  of  the  unpurified  reaction  or  of  the  gel- purified 
product  (indicated  with  an  asterisk)  revealed  the  formation  of 
structures  with  the  designed  shapes.  Structures  with  polyT 
linkers  between  domains  were  imaged  after  the  addition  of 
complementary  polyA  strands.  They  exhibited  a  wide  range  of 
conformations  when  imaged  due  to  the  flexibility  inherent  in 
the  linker  regions. 

The  assembly  yield  was  quantified  using  native  gel 
electrophoresis  (Figure  3a),  We  assembled  structures  iso¬ 
thermally  for  l  h  (domain  length  <16  nt)  or  12  h  (domain 
length  >16  nt)  at  a  range  of  temperatures  and  then  quantified 
the  gel  yield  in  each  lane  (which  corresponds  to  a  particular, 
fixed  assembly  temperature)  using  the  TotalLab  Quant  gel 
quantification  software  (Figure  3b),  Here,  the  yield  is  defined  as 
the  ratio  of  the  intensity  of  the  product  band  divided  by  the 
intensity  of  the  entire  lane  after  proper  background  correction. 
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Figure  2 .  SST-based  DNA  nanostructures  can  assemble  iso  thermally  across  a  wide  range  of  temperatures.  Representative  structures  that  assemble 
isotherm  ally  at  differ  ent  temperatures  are  shown  and  collectively  cover  the  entire  temperature  range  from  15  °C  (blue)  to  69  (red).  Structures  are 
ordered  and  displayed  close  to  their  optimal  formation  temperatures.  For  each  structure,  strand  diagrams  (outside),  AFM  images  (middle),  and  gel 
electrophoresis  results  (inside)  are  shown.  A  tagged  bar  indicates  part  of  the  range  of  successful  assembly  temperatures  shown  by  agarose  gel 
electrophoresis  (see  Supporting  Information  Figure  S2  for  full  gel  images).The  leftmost  three  structures  contain  IGT  linkers  between  domains;  the 
other  structures  do  not,  as  shown  in  the  strand  diagrams.  Numbers  in  strand  diagrams  indicate  domain  lengths  (unit:  nt),  AFM  images  show  the 
typical  morphology  at  the  indicated  temperatures;  images  with  an  asterisk  contain  gel -purified  structures.  Structures  were  assembled  isothermally  for 
12  h  using  200  nM  of  each  strand  and  10  mM  Mg2*. 
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Figure  3.  Quantification  of  optimal  assembly  temperature  and  effect  of  domain  and  linker  lengths.  Structures  were  assembled  isothermally  for  1  h 
across  a  temperature  range  using  200  nM  of  each  strand  and  10  mM  Mg2*  and  assayed  using  gei  electrophoresis,  (a)  Gel  images  were  quantified 
using  TotalLab  Quant  software.  Gel  yield  was  de lined  as  the  integrated  intensity  of  the  formation  band  (green)  divided  by  the  total  intensity  of  the 
lane  (red  +  green  +  blue),  (b)  Gel  yield  of  three  representative  structures  (L  =  0,  2,  10)  and  Gaussian  fits  (solid  lines)  are  shown.  (c,d)  Formation 
yields  as  a  function  of  temperature  are  shown  for  structures  with  varying  domain  lengths  (D  -  8— 21;  c)  and  linker  lengths  (L  =  0—16;  d).  Each  circle 
represents  a  single  yield  quantification,  with  its  radius  proportional  to  the  measured  yield;  colors  indicate  linker  length  from  no  linker  ( L  =  0;  green) 
to  16T  linker  (L  -  16;  dark  orange). 


This  is  only  an  approximation  of  the  true  formation  yield13  but 

is  useful  for  comparing  relative  yields  for  assembly  under  function  with  a  constant  background  using  custom  MATLAB 

varying  conditions.  The  yield  curves  were  then  fit  to  a  Gaussian  software  (Figure  3b).  The  yield  as  a  function  of  temperature  is 
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Figure  4.  Optimal  assembly  temperatures  can  be  rationally  designed  with  linker  domains  and  other  methods,  (a)  Shifting  to  lower  optimal  assembly 
temperatures  (blue)  by  addition  of  linkers  (dashed  line:  physiological  temperature).  Strand  diagrams  of  four  representative  designs  with  distinct 
strand  geometry  are  shown,  (b)  Rational  design  of  optimal  assembly  temperature  (blue,  lower;  red,  higher)  with  three  other  methods;  altering  the 
GC  content  of  domains  to  30%  (low  GC)  or  70%  (high  GC),  splitting  domains  in  half  and  changing  domain  lengths.  In  the  case  of  splitting 
domains,  a  single  T  nucleotide  is  inserted  in  the  middle  of  each  domain  (blue  dots)  to  split  each  strand  into  a  5-T-5-1OT-6-T-5-10T-S-T-5- J0T-5-T- 
6  sequence  pattern.  In  both  panels,  numbers  in  strand  diagrams  indicate  domain  lengths  (unit:  nt),  tagged  bars  delimit  the  full  width  at  half¬ 
maximum  (fwhm)  of  a  Gaussian  fit  to  the  yield  as  a  (unction  of  assembly  temperature,  and  a  circle  denotes  the  optimal  temperature  (see  Supporting 
Information  Figure  S4  for  raw  gel  data). 
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Figure  5,  Assembly  of  structures  under  biocompatible  conditions,  ( a— d )  Effects  of  assembly  conditions  on  formation  yield  of  two  representative 
structures  (olive,  2T  linker;  orange,  10T  linker;  see  strand  diagrams).  Structures  were  assembled  for  1  h  at  37  °C  using  200  nM  of  each  strand.  Yields 
are  calculated  as  in  Figure  3a,  Error  bars  indicate  the  standard  deviation  based  on  >3  replicate  experiments,  (a-c)  Effects  of  pH,  [Mg2*],  and  [PEG- 
8000]on  formation  yield.  Each  parameter  is  varied  alone  while  keeping  the  others  at  default  values  (Q.5X  TE  buffer,  pH  S,  10  mM  Mg2*,  no  PEG- 
8000).  HEPES  buffer  was  used  for  the  pH  7  condition,  (d)  Combined  effect  of  low  salinity  (  l  mM  Mg2*,  100  mM  NaCl)  and  [PEG-8000],  (e)  A 
schematic  of  the  crowding  model:  formation  takes  place  at  in  vitro  (no  crowding,  high  salinity)  or  biocompatible  (crowding,  low  salinity)  conditions. 
Orange:  component  strands.  Dark  green:  magnesium  ions,  (f)  Gel  result  for  the  10T  linker  structure  at  the  four  conditions  shown  in  (e):  with  or 
without  PEG- 8000  (-,0  mM;  +,  20  mM),  with  low  and  high  salinity  (L,  I  mM  Mg2*  100  mM  Na*;  H,  10  mM  Mg2*),  (g)  A  simple  mathematical 
model  of  crowding,  based  on  effective  concentrations  (see  Supporting  Information  Text  S3  for  details),  predicts  assembly  under  low  [Mg2*]  and  high 
[PEG- 8000]  conditions. 


well-fit  by  a  Gaussian  function,  indicating  that  our  structures 
assemble  optimally  at  a  specific  isothermal  temperature. 

We  show  dependence  between  the  assembly  temperature 
and  the  domain  length  (Figure  3c;  see  Supporting  Information 
Figure  S3a  for  raw  gel  data)*  The  Gaussian  fits  obtained  from 
curve  fitting  to  the  gel  quantification  data  allowed  us  to  obtain  a 
mean  and  a  full- width  at  half- maximum  (fwhm)  for  each 
structure  that  define  an  optimal  temperature  and  an  optimal 
temperature  range,  respectively.  The  formation  range  is 
indicated  by  the  extent  of  the  scatterplot,  and  the  relative 


yields  at  each  temperature  are  indicated  by  the  diameter  of  the 
marker.  We  examined  two  sets  of  structures,  those  with  no 
linker  between  domains  (green)  and  those  with  a  10T  linker 
between  domains  (orange).  In  both  cases,  we  observed  that  the 
formation  range  was  a  mo  no  tonic  ally  increasing  function  of  the 
domain  length  with  diminishing  returns  at  higher  temperatures. 
It  should  be  noted  that  since  all  of  our  domains  have  a  similar 
GC  content  (around  50%)  that  domain  length  is  a  proxy  for  the 
strength  of  strand— strand  interactions  in  these  experiments. 
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Linkers  between  domains  caused  an  ^10  °C  downward  shift 
in  the  assembly  temperature  (Figure  3d;  see  Supporting 
Information  Figure  S3b  for  raw  gel  data).  Even  a  single  T 
nucleotide  between  domains  was  sufficient  to  shift  the  assembly 
temperature;  additional  T  nucleotides  did  not  contribute  to  the 
temperature  shift*  As  in  Figure  3b,  we  show  the  formation  range 
using  a  scatterplot,  where  the  relative  yield  at  each  temperature 
is  indicated  by  the  diameter  of  the  marker.  The  effect  of  the 
linkers  did  not  depend  on  the  polyT  sequence;  a  linker 
consisting  of  10  A  nucleotides  caused  a  similar  downward  shift 
in  assembly  temperature  (see  Supporting  Information  Figure 
S3c)* 

We  redesigned  many  structures  to  assemble  at  the 
physiological  temperature  using  polyT  linkers  (Figure  4a;  see 
Supporting  Information  Figure  S4a  for  raw  gel  data).  As  shown, 
the  effect  of  polyT  linkers  was  similar  for  several  different 
strand  geometries,  causing  a  downward  shift  of  the  assembly 
temperature  of  7-10  °C  In  all  cases,  the  redesigned  structures 
formed  well  at  the  physiological  temperature  (37  ^C), 
indicating  the  generality  of  this  approach  for  programming 
the  assembly  temperature  of  DNA  nanostructures*  Further¬ 
more,  we  demonstrated  several  additional  strategies  to  shift  the 
assembly  temperature  (Figure  4b;  see  Supporting  Information 
Figure  S4b  for  raw  gel  data):  altering  the  strength  of  strand- 
strand  interactions  via  the  GC  content,  changing  the  length  of 
the  complementary  domains  within  a  structure,  or  splitting 
domains  in  half  by  inserting  a  single  T  nucleotide.  Thus,  there 
exists  a  wide  variety  of  ways  to  program  the  assembly 
temperature  of  a  DNA  nanostructure*  This  should  enable 
structures  with  varying  geometries  and  morphologies  that 
assemble  at  desired  temperatures. 

Flexible  SST  DNA  structures  assembled  under  biocompatible 
conditions.  A  number  of  structure  designs  assembled  well  at  the 
physiological  temperature  (37  °C;  Figures  2,  3,  and  4). 
However,  there  exist  other  environmental  variables,  such  as 
pH,  salinity,  and  molecular  crowding,  which  must  be  accounted 
for  in  order  for  structure  assembly  to  be  biocompatible.  We 
tested  two  structure  designs  which  assembled  well  at  37  °C  in  1 
h  (Figure  3b),  one  with  a  2T  linker  (olive)  and  one  with  a  10T 
linker  (orange)  by  systematically  varying  other  environmental 
variables  in  vitro  (Figure  5;  see  Supporting  Information  Figure 
35  for  raw  gel  data).  Yields  are  determined  as  in  Figure  3a. 

Physiological  pH  is  compatible  with  structure  assembly 
(Figure  5a).  We  varied  the  pH  of  the  buffer  in  which  the 
structures  were  assembled  from  7  to  9,  which  encompassed  the 
physiological  pH  range.  Both  structures  assembled  well  across 
this  pH  range,  indicating  that  the  physiological  pH  should  not 
prevent  SST  structures  from  self-assembling. 

Low  salinity  hindered  structure  formation  (Figure  5b)*  DNA 
nanostructures  are  typically  assembled  in  vitro  with  a  Mg2* 
concentration  in  excess  of  10  mM.4,6'113'  Recently,  it  was 
shown  that  sodium  ions  can  also  be  interchanged  for 
magnesium  ions  in  some  cases*3*  However,  the  free 
concentration  of  magnesium  equivalents  in  the  JE*  coli 
cytoplasm  is  approximately  2-4  mM  (1—2  mM  Mg2*,  180- 
ZOO  mM  Na*  or  K+  39,4°).  Our  flexible  SST  designs  assembled 
dose  to  optimally  at  as  low  as  6  mM  Mg2*  (2T  linker,  olive)  or 
4  mM  Mg24  (10T  linker,  orange),  approaching  but  not  quite 
reaching  the  biologically  relevant  salinity. 

Crowded  conditions  decreased  the  assembly  yield  (Figure 
5c)*  A  crowded  environment  was  simulated  using  polyethylene 
glycol  with  an  average  molecular  weight  of  8000  Da  (PEG- 
S000).  To  test  the  effect  of  crowding  on  structure  assembly,  the 


PEG-8000  concentration  was  varied  from  0  to  23.5  mM.  A 
Mg2+  concentration  of  10  mM  was  used  in  these  experiments. 
We  observed  a  decline  in  yield  for  both  structure  designs  with 
increasing  PEG-8000  concentrations,  combined  with  the 
appearance  of  a  high  molecular  weight  band  indicative  of 
aggregation  (see  Supporting  Information  Figure  S5c  for  raw  gel 
data), These  results  indicate  that  a  combination  of  crowding  and 
high  magnesium  decreases  assembly  yield.  Additionally,  we 
have  noted  that  the  long  linker  design  (orange)  had  a 
consistently  higher  yield  than  the  short  linker  design  (olive) 
under  the  conditions  we  tested  (Figure  5a— d),  perhaps  due  to 
increased  flexibility  enhancing  the  assembly  process  (e.g.,  by 
relieving  the  electrostatic  repulsion  between  parallel  DNA 
helices). 

Molecular  crowding  compensated  for  low  magnesium, 
thereby  enhancing  structure  assembly  (Figure  5d).  When 
considered  separately,  both  salinity  and  crowding  pose 
challenges  to  biocompatible  assembly  of  tile-based  DNA 
nanostructures.  However,  when  we  tested  the  two  conditions 
together  by  varying  the  PEG- 8000  concentration  from  0  to  20 
mM  with  low  salinity  (1  mM  Mg2*,  100  mM  Na*)  at  pH  7.5, 
we  observed  that  increased  crowding  actually  improved  the 
assembly  yield.  This  effect  was  seen  for  both  short  (olive)  and 
long  linker  designs  (orange).  Together,  these  results  indicate 
that  SST  DNA  nanostructures  can  be  designed  to  assemble 
under  biocompatible  conditions. 

We  conjecture  that  molecular  crowding  enhances  structure 
assembly  by  increasing  the  effective  magnesium  concentration 
(Figure  5e).  In  the  absence  of  crowding,  high  salinity  is 
required  for  assembly  (top  left),  but  in  the  presence  of 
crowding  high  salinity  inhibits  assembly.  This  model  is 
consistent  with  our  experimental  results;  structure  assembly 
(product  band)  is  observed  under  high  salinity  with  no 
crowding  and  low  salinity  with  crowding  but  not  under  low 
salinity  with  no  crowding  or  under  high  salinity  with  crowding 
(Figure  50-  We  used  this  insight  to  construct  a  quantitative 
model  of  the  effect  of  molecular  crowding  on  structure 
assembly  and  predict  a  range  of  conditions  under  which 
assembly  is  likely  to  occur,  indicated  by  the  shaded  regions 
(Figure  5g;  see  Supporting  Information  Text  S3  and  Figure  S6 
for  details). 

SST  structures  can  be  tuned  to  assemble  isothermally  across 
a  range  of  temperatures  from  15  to  69  °C*  Additionally,  we 
demonstrated  four  different  ways  to  modulate  the  assembly 
temperature  of  a  structure:  changing  domain  length,  altering 
domain  GC  content,  adding  a  linker  between  domains,  or 
splitting  a  domain  in  half  This  has  enabled  us  to  design  SST 
structures  that  can  assemble  well  under  biocompatible 
conditions.  More  generally,  we  now  have  the  capability  to 
design  structures  that  can  assemble  under  a  diverse  range  of 
specified  conditions.  We  noticed  that  structures  tend  to 
assemble  with  lower  yield  at  extreme  temperatures  or 
conditions*  However,  the  yield  at  a  particular  condition,  or 
the  range  of  assembly  conditions,  can  be  increased  by  extending 
the  reaction  time.  Together,  these  results  indicate  that  the 
isothermal  SST  assembly  process  is  remarkably  robust  to  the 
assembly  temperature  and  conditions. 

Our  ability  to  assemble  structures  at  room  temperature  can 
enable  the  study  of  structure  assembly  in  real  time  using  atomic 
force  microscopy1232  or  super- re  solution  microscopy  techni¬ 
ques;41  at  the  other  end  of  the  spectrum  we  can  assemble 
structures  at  temperatures  above  the  typical  melting  temper¬ 
ature  of  previous  DNA  nanostructures.  More  broadly,  our 
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unique  ability  to  control  the  assembly  temperature  of  DNA 
nanostructures  allows  structured  to  be  tailored  to  in  vivo  or  in 
situ  applications  where  control  of  the  ambient  temperature  may 
not  be  feasible.  Our  success  in  biocompatible  assembly  is 
particularly  exciting  because  it  is  a  key  step  toward  in  vivo 
assembly  of  geometrically  precise  nanostructures  from  many 
individual  DNA  or  RNA  components.  Furthermore,  biocom- 
patible  assembly  takes  us  beyond  the  paradigm  of  annealing 
structures  over  many  hours  under  precisely  controlled  reaction 
conditions.  Our  approach  of  rationally  modulating  the  assembly 
conditions  should  be  generalizable  to  other  types  of  structures 
with  a  few  potential  constraints.  One  such  constraint  is  that  the 
component  strands  should  not  contain  significant  secondary 
structures  that  hinder  their  interactions  with  other  strands.  For 
example,  the  secondary  structure  of  a  typical  M13  DNA  origami 
scaffold  strand  may  interfere  with  the  designed  structure 
formation  at  low  temperatures.  A  scaffold  with  designed 
sequence  could  be  used  to  overcome  this  constraint. 

Beyond  tunable  SST  structures,  control  over  the  assembly 
conditions  could  perhaps  be  further  improved  by  exploiting 
toe-hold-mediated42  strand  displacement  cascades,1  r43  for 
example,  using  the  triggered  isothermal  assembly  of  reconfig- 
urable  hairpins* l8,3Sl44  Although  the  resulting  hairpin -based 
structures  (e.g.,  branched  junctions,1*  polymers,3*'44'46  or 
dendrimers1*)  are  not  geometrically  precise  and  typically 
involve  very  few  component  strands,  they  can  be  formed 
dynamically  via  the  introduction  of  an  initiator  strand. 
Additionally,  a  simple  two-component  toe-hold  exchange 
based  system  has  been  shown  be  robust  to  temperature, 
salinity,  and  concentration.46  Thus,  by  integrating  robustness  of 
toe -hold  based  systems  vvith  the  tunability  and  geometrical 
control  of  the  SST  systems  described  here,  one  may  potentially 
design  self-assembly  systems  that  could  allow  for  assembly 
across  a  wide  range  of  temperatures  and  conditions  without  the 
need  for  condition-specific  tuning, 

A  rational  next  step  after  bio  com  patible  assembly  is  to 
assemble  geometrically  precise  RNA  nanostructures  in  vivo  and 
to  use  them  to  organize  metabolic  enzymes  or  other  functional 
biomolecules,12  One  could  build  upon  the  knowledge  and 
experience  from  DNA  SST  assembly  to  design  and  rapidly 
prototype  RNA  structures  that  assemble  under  biocompatible 
conditions  and  act  as  scaffolds.  With  greater  freedom  in  design 
geometry  and  positioning  of  protein-binding  sites,  one  would 
be  able  to  design  and  test  interactions  between  multiple 
functional  molecules  across  varying  distances  and  geometrical 
patterns.  One  can  thus  survey  a  wider  design  space  than  has 
been  achieved  with  previous  protein4748  or  RNA12  scaffolds. 
DNA  or  RNA  nanostructures  could  be  delivered  or  expressed 
as  individual  strands  and  assembled  directly  at  the  site  of 
interest*  Taken  together,  our  results  present  a  new  opportunity 
to  assemble  precise,  programmable  structures  in  biological  and 
cellular  environments,  which  could  lead  to  exciting  in  vitro  and 
in  vivo  applications. 

■  ASSOCIATED  CONTENT 
©  Supporting  Information 

Supplementary  methods,  structure  naming  conventions,  notes 
about  experiments,  modeling  details,  and  six  supplementary 
figures.  This  material  is  available  free  of  charge  via  the  Internet 
at  http;//pubs.aes.org. 


■  AUTHOR  INFORMATION 
Notes 

The  authors  declare  no  competing  financial  interest. 

■  ACKNOWLEDGMENTS 

We  thank  Bryan  Wei,  Dave  Zhang,  Erik  Winfree,  and  Robert 
Barish  for  discussion.  This  work  was  funded  by  a  DARPA 
Living  Foundries  grant  HR001 1-12-C-0061  to  PA.S.  and  P*Y., 
an  Office  of  Naval  Research  (ONR)  Young  Investigator 
Program  award  N000141 110914  to  P.Y.,  an  ONR  grant 
N000 14101 0827  to  P.Y.,  an  National  Science  Foundation 
(N5F)  Faculty  Early  Career  Development  Award  1054898  and 
an  NSF  grant  1162459  to  F.Y,  C.M.  acknowledges  a  graduate 
research  fellowship  from  the  Fannie  and  John  Hertz 
Foundation,  M.D.  acknowledges  an  international  student 
research  fellowship  from  the  Howard  Hughes  Medical  Institute. 

■  REFERENCES 

(1)  Yan,  H.;  Park,  S.  FL;  Finkelstein,  G4  Reif,  J.  H.;  LaBean,  T.  H. 
Science  2003,  301,  1882-4. 

(2)  Chworos,  A*;  Severcan,  L;  Koyfman,  A  Y*;  Weinkam,  P*; 
Oroudjcv,  E.;  Hansma,  H.  G*;  Jaeger,  L,  Science  2004,  306,  2068— 72. 

(3)  Rothemund,  P.  W.  3C;  Papadakis,  M.;  Winfree,  E.  PLoS  Biol. 
2004,  2,  e424* 

(4)  Rothemund,  P.  W.  K*  Nature  2006,  440,  297^302. 

(5)  Yin,  P.;  Hariadi,  R,  F.;  Sahu,  S.;  Choi,  H.  M.  T.;  Park,  S*  H.; 
Labean,  T.  H.;  Reif,  J.  H.  Science  2008,  321,  824—6. 

(6)  Douglas,  S.  M.;  Dietz,  H.j  Liedl,  T.j  Hdgberg,  B.;  Graf,  F.j  Shih, 
W.  M,  Nature  2009,  459 ,  414-8, 

(7)  Dietz,  PL;  Douglas,  S.  M.;  Shih,  W.  M.  Science  2009,  325,  725“ 
30. 

(S)  Zheng,  J*;  Birktoft,  J,  J*;  Chen,  Y4  Wang,  T*;  Sha,  R*; 
Constantin  on,  P*  E.;  Ginell,  S*  Lj  Mao,  C-;  Seeman,  N*  C.  Nature 
2009,  461,  74“7. 

(9)  Andersen,  E.  S.j  Dong,  M.f  Nielsen,  M.  M.;  Jahn,  K.;  Subramani, 
FL;  Mamdouh,  W.j  Go  las,  M.  M.j  Sander,  B.j  Stark,  H.j  Oliveira,  C.  L, 
P*;  Pedersen,  J.  S.;  Birkedal,  V.;  Besenbacher,  F.;  Goth  elf,  K*  V;  Kjems, 
j.  Nature  2009,  459,  73-6. 

(|0)  Yang,  H*;  McLaughlin,  C.  1C;  Aldaye,  F*  A;  Hamblin,  G.  D,; 
Rys,  A  Z>;  Rouiller,  U  Sleiman,  H-  F.  Nature  Chem.  2009,  1,  390—6. 

(11)  Han,  D.;  Pal,  S*;  Nangreave, p,  Deng,  Z4  Liu,  Y.;  Yan,  H.  Science 
2011,  332,  342-6. 

(12)  Delebecque,  C.  j.;  Lindner,  A  B.;  Silver,  P.  A;  Aidaye,  F.  A 
Science  2011,  333,  470-4. 

(13)  Wei,  B.j  Dai,  M.;  Yin,  P.  Nature  2012,  485,  623-6. 

(14)  Ke,  Y.;  Ong,  L.  L,j  Shih,  W*  M*;  Yin,  P.  Science  2012, 338,  1177— 
1183. 

(15)  Sherman,  W.  &.;  Seeman,  N.  C.  Nano  Lett,  2004,  4,  1203- 1207. 

(16)  Yin,  P.;  Yan,  H.;  Daniel!,  X.  G.;  Turberfield,  A  J.j  Reif,  J.  H. 
Angew.  Chem.,  Int  El  2004,  43,  4906-11. 

(17)  Zhang,  D.  Y.;  Turberfield,  A  J.;  Yurke,  B*;  Winfree,  E.  Science 
2007,  3 IS,  1 121—5. 

(IS)  Yin,  P,;  Choi,  H.  M,  T.j  Calvert,  C.  FLj  Pierce,  N.  a  Nature  2008. 
451,  318-22. 

(19)  Omabegho,  T.;  Sha,  R*;  Seeman,  N.  C*  Science  2009,  324,  67— 
7J* 

(20)  Lund,  IC;  Manzo,  A  j*;  Dabby,  N*;  Michelotti,  N.;  Johnson’ 
Buck,  A;  Nangreave,  J.;  Taylor,  S*;  Pei,  R*;  Stojanovic,  M*  N*j  Walter, 
N.  G*;  Winfree,  E.;  Yan,  H.  Nature  2010,  465,  206—10. 

(21)  Qian,  L.;  Winfree,  E.  Science  2011,  332,  1196—201. 

(22)  Wickham,  S.  F.  J.;  Bath,  J.;  Katsuda,  Y.;  Endo,  M*;  Hidaka,  K*; 
Sugiyama,  H*;  Turberfield,  A.  J.  Nat.  NanotecknoL  2012,  7,  169—73. 

(23)  Douglas,  S.  M.|  Bachelet,  L;  Church,  G.  M.  Science  2012,  335, 
831-4. 

(24)  Derr,  N.  D.;  Goodman,  B.  S.;  Jungmann,  R;  Leschziner,  A  E.; 
Shih,  W*  M.;  Reck- Peterson,  S*  L.  Science  2012,  338,  662—5. 


4247 


dx.dtN.org/iai  02 i/ni40i 9?i  1 1  mm  Lett  2QU  is,  4343-4343 


Nano  Letters 


Letter 


(25)  Acuna,  G.  F.j  Moller,  F,  M.;  Holzmeister,  P.j  Beater,  S.j  Lalkens, 
B.;  Tinnefeld,  P,  Science  2012,  338,  506“  10. 

(26)  Lin,  C.j  Jungmann,  R;  Leifer,  AM.;  Li,  C;  Levner,  D.;  Church, 
G.  M.;  Shih,  W.  M.;  Yin,  P.  Nature  Chem.  2012,  4,  832-9. 

(27)  Maune,  H.  T.;  Han,  S^P.;  Bari$h,  K  D.;  Bockrath,  M;  lii,  W,  A 
G.;  Rothemund,  P.  W.  Kj  Winfree,  E.  Nat  NanotechnoL  2010,  5,  61™ 
6, 

(28)  Douglas,  S.  M.;  Chou,  J,  J.j  Shih,  W,  M.  Pruc.  Natl  Acad  Sci , 
U.SJL  2007,  104,  6644-8. 

(29)  Lee,  H.;  Lytton-Jean,  A  KL  R;  Chen,  Y.j  Love,  K.  T.;  Park,  A  I,; 
Karagiannis,  E.  D.j  Sehgal,  A;  Querbes,  W.;  Zurenko,  C  S.;  Jayaraman, 
M.;  Peng,  C.  G.;  Charisse,  K;  Borodovsky,  A;  Manoharan,  M.; 
Donahoe,  J.  S.;  Tmelove,  J.j  Nahrendorf,  M;  Danger,  Rj  Anderson,  D. 
G.  Nat  NanotechnoL  2012,  7,  389-93. 

(30)  Liu,  Xj  Xu,  Y.j  Yu,  T.;  Clifford,  C;  Liu,  Y.;  Yan,  H.;  Chang,  Y. 
Nano  Lett.  2012,  12,4254-9. 

(31)  jungmann,  R;  Liedl,  T.j  Sobey,  T.  L;  Shih,  W,;  Simmel,  F.  C,  /. 
Am,  Chem .  Soc,  2008,  130 ,  10062-3. 

(32)  Evans,  C.  G.j  Hariadi,  R  F.j  Winfree,  E.J.  Am.  Chem i.  Soc.  2012, 
134,  10485-92. 

(33)  Schulman,  R;  Yurke,  B.j  Winfree,  E.  Proc.  Natl  Acad  Sci  USA, 
2012,  109 ,  6405-10. 

(34)  Sobczak,  J.  J.;  Martin,  T.  G.;  Gerling,  T.;  Dietz,  H.  Science  2012, 
338,  14S8-6L 

(35)  Dirks,  R  M.j  Pierce,  N.  A  Proc.  Natl.  Acad.  Sci  USA  2004,  iOJ, 
15275-8. 

(36)  Wei,  B.;  Dai,  M.;  Myhrvold,  C.j  Ke,  Y,;  jungmann,  R;  Yin,  P.  /. 
Am,  Chem.  Soc.,  in  press, 

(37)  Winfree,  E.;  Liu,  F.;  Wenzler,  L,  a;  Seeman,  R  C.  Nature  1998, 
394,  539-44. 

(38)  Martin,  T.  G.j  Dietz,  H.  Nat.  Commun.  2012,  3,  1103. 

(39)  Alatossava,  T.j  Jiitte,  H.;  Kuhn,  A.;  Kellenberger,  E.  /.  Baderiol. 
1985,  162,  413—9. 

(40)  ShabaJa,  L.j  Bowman,  j.;  Brown;  j.j  Ross,  T.;  McMeekin,  T.; 
S  ha  bat  a,  S.  Environ.  Microbiol,  2009,  11,  137—48. 

(41)  Jungmann,  R;  Steinhauer,  C,;  Scheiblc,  M.;  Kuzyk,  A; 
Tinnefeld,  P.;  Simmel,  F.  C.  Nano  Lett.  2010,  10,  4756—61. 

(42)  Yurke,  B.;  Turberfield,  A  j.j  Mills,  A  P.,  Jr.;  Simmel,  F,  C.; 
Neumann,  J.  L.  Nature  2000,  406,  605— 608. 

(43)  Seelig,  G.;  Soloveichik,  D.j  Zhang  D.  Y.;  Winfree,  E.  Science 
(New  York,  NX)  2006,  314,  1585-8. 

(44)  Choi,  H.  M.  T.;  Chang,  J.  Y.}  Trinh,  L.  A;  Padilla,  J.  £.;  Fraser,  S. 
E.;  Pierce,  N.  A  Nat  Biotechnol  20 10,  28,  1208—12. 

(45)  Venkataraman,  S.f  Dirks,  R  M,;  Rothemund,  P.  W.  K.;  Winfree, 
E,;  Pierce,  N.  A  Nat  NanotechnoL  2007,  2,  490-4. 

(46)  Zhang,  D.  Y.-,  Chen,  S,  X.,  Yin,  P.  Nature  Chem.  2012,  4,  208- 
14 

(47)  Dueber,  J.  E.;  Wu,  G.  C.j  Malmirchegini,  G.  R;  Moon,  T.  S.; 
Petzold,  C.  J.;  Ulht,  A  V;  Prather,  K  L.  J.;  Keasling,  j.  D.  Nat 
Biotechnol  2009,  27,  753—9. 

(48)  Whitaker,  W.  R;  Davis,  S.  A;  Arkin,  A  P.;  Dueber,  J.  E.  Proc . 
Natl.  Acad.  Sd.  USA.  2012,  109,  18090-5. 


4248 


djLdtH.crgAT  0. 1 0Z  VnUOl  951 2 1  Mno  tet?.  201  i  1 3 .  4  242  ^  424£ 


